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I.  INTRODUCTION 


Reported  below  are  the  results  of  a  study  carried  out  at  the  Center  for  Communications 
and  Signal  Processing  Research  at  the  New  Jersey  Institute  of  Technology  between  May 
1,  1993  and  July  31,  1994.  This  research  is  a  continuation  of  previous  research  performed 
during  similar  periods  in  1991/92  [1]  and  1992/93  [2].  The  aim  of  the  research  is  to  further 
widen  the  investigation  of  the  bootstrapped  algorithm  to  possible  applications  solving  real 
system  problems. 

Historically,  the  idea  of  the  algorithm  as  a  way  for  canceling  interferences  was  first  pro¬ 
posed  by  the  principle  investigator  in  1981  [3],  and  later  it  was  used  for  canceling  cross¬ 
polarization  in  satellite  communication  [4]  and  in  the  microwave  terrestrial  radio  link  [5  - 

7]. 

The  bootstrapped  interference  canceler  is  principally  composed  of  two  separate  cancelers, 
each  using  the  output  of  the  other  canceler  as  its  reference  (desired  signal)  input.  In  fact, 
such  a  structure  performs  as  a  “Signal  Separator”  rather  than  an  interference  canceler.  Since 
for  its  operation  there  is  no  need  for  a  reference  signal,  it  is  sometimes  justifiably  called  a 
“Blind  Separator.” 

Three  different  structures  were  proposed  in  [1];  namely,  the  Backward/ Backward  (BB), 
the  Forward/Forward  (FF)  and  the  Forward/ Backward  (FB).  They  are  depicted  in  Figures 
1,2,  and  3.  The  different  weights  of  these  two-input /two-output  separators  can  be  controlled 
by  minimizing  the  output  power  or  minimizing  the  absolute  value  of  the  cross- correlation 
between  the  two  outputs. 

During  the  1991/92  phase  of  the  research  [1],  the  steady  state  behavior  of  these  separa¬ 
tors  was  evaluated.  The  effect  of  additive  noise  on  the  performance  of  the  separator  was  also 
examined.  It  was  shown  that  extension  of  all  three  structures  to  multi-input /multi-output  is 
possible.  Therefore,  it  was  expected  that  applications  to  multi-user  Code  Division  Multiple 
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Access  (CDMA)  or  neural  networks  is  feasible.  Only  some  preliminary  work  on  the  appli¬ 
cability  of  the  algorithms  was  performed  during  this  phase  of  the  research  and  some  results 
were  presented.  In  particular,  emphasis  was  put  on  the  use  of  the  three  structures  in  han¬ 
dling  a  dually-polarized  signal.  Error  probabilities  of  the  bootstrapped  cross-pol  cancelers 
for  M-ary  QAM  signals  were  examined.  Their  performance  was  compared  with  other  kinds 
of  cancelers  known  in  the  literature. 

It  was  concluded  at  this  phase  of  the  research  that  the  bootstrapped  algorithms  have 
many  useful  properties  which  make  them  excellent  candidates  for  use  as  signal  separators 
or  interference  cancelers  when  other  algorithms  have  difficulties.  In  some  cases,  they  clearly 
outperform  other  algorithms.  In  particular,  it  was  demonstrated  that  the  algorithm  has  the 
property  of  converging  to  its  steady  state,  where  signal  separation  occurs,  much  faster  than 
other  algorithms.  Unlike  other  algorithms,  the  speed  of  convergence  does  not  depend  on 
the  signals’  power  ratios  and,  hence,  does  not  depend  on  the  eigenvalue  spread  of  the  input 
correlation  matrix. 

The  1992/93  phase  of  the  research  [2]  was  aimed  toward  further  analysis  and  evaluation  of 
the  performance  of  these  algorithms.  Inherently,  the  three  structures  exhibit  different  levels 
of  complexity,  particularly  when  used  in  very  high  frequencies,  as  in  microwave  communica¬ 
tions.  They  also  present  different  delay  paths  to  signals  and,  hence,  have  different  system 
bandwidths.  Performance  studies  in  this  regard  show  the  existing  complexity-bandwidth 
trade  off.  The  question  of  stability  of  the  steady  state  was  also  examined.  Two  ways  of 
real-time  realization  were  proposed.  One  uses  orthogonal  perturbation  sequences,  while  the 
other  implements  weight  dithering  with  PN  sequences. 

Further  work  on  delay  controlled  structures,  which  began  in  1991/92,  was  continued 
in  1992/93.  The  work  showed,  in  particular,  that  such  a  structure  performs  simultane¬ 
ous  spatial  separation  and  direction-of-arrival  estimation  of  wideband  sources.  With  multi¬ 
input/multi-output  applications,  we  obtained  new  results  in  blind  separation  of  signals  when 
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the  environment  is  dispersive. 

Some  preliminary  results  were  also  obtained  in  applying  these  algorithms  to  separation 
of  co-channel  CDMA  signals,  as  well  as  blind  channel  equalization. 

Besides  some  issues  of  implementation,  during  the  current  phase  of  research  (93/94) 
we  concern  ourselves  with  two  separate  directions  of  application,  separation  of  multi-user 
CDMA  signals  and  blind  equalization  of  dispersive  communication  channels.  Some  neu¬ 
ral  network-type  convergence  studies  were  also  performed.  For  each  of  these  applications, 
system  structures  and  block  diagrams  are  suggested.  Simulations  are  performed  to  prove 
applicability  of  the  systems  proposed.  Some  software  modules  were  prepared  to  integrate 
the  software  in  the  Signal  Processing  Workstation  environment. 

Section  (2),  below,  is  a  technical  summary  of  the  study  performed  during  this  research 
and  its  results.  Detailed  reports  on  which  this  summary  is  based  are  given  in  the  appendices 
of  the  document.  Each  appendix  covers  a  specific  part  of  the  research  and  is  written  in  a 
way  that  can  be  read  independently  of  the  other  parts.  Section  (3)  contains  the  conclusions 
and  recommendations  for  further  studies. 
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II.  TECHNICAL  SUMMARY 


2.1  Issues  Concerning  the  Implementation  of  the  Algorithms 

The  multi-input,  multi-output  structure  of  the  bootstrap  algorithm  was  considered  in  [1] 
(Appendices  E  and  F).  In  Appendix  G  of  [2]  the  performance  of  the  backward/forward 
structure  when  it  separates  multi-signal  composites  in  a  multi-channel  dispersive  environ¬ 
ment,  is  discussed.  Further  results  for  implementing  these  algorithms  are  given  in  Appendix 
A  of  this  report. 

In  the  first  part,  entitled,  “Fast  Decorrelation  Algorithms  for  Signal  Separation”  the  for¬ 
ward/forward  structure  of  the  bootstrapped  algorithm,  termed  “decorrelation  algorithm,”  is 
suggested  for  the  separation  of  an  unknown  linear  combination  of  source  signals.  The  Steep¬ 
est  Descent  (LMS)  and  Recursive  Least  Square  (RLS)  versions  of  the  algorithm  are  consid¬ 
ered.  The  results  are  compared  with  those  of  the  conventional  LMS  algorithm,  wherein,  in 
contrast  to  the  decorrelation  algorithm,  known  references  of  the  signal  are  assumed  avail¬ 
able.  In  particular,  it  is  proved  that  the  decorrelation  algorithm  exhibits  a  smaller  eigenvalue 
spread  and  hence  converges  faster  than  the  conventional  LMS.  In  this  study,  the  observation 
Xm{t)  is  taken  as  a  linear  combination  of  the  input  sources,  such  that, 

N  , 

^m(f)  =  ^  ^  y  ^nhn{t)  +  (l) 

i  n=l 

for  1  <  m  <  A.  ^  is  an  information  symbol  bit  of  the  n-th  source  at  the  i-th 

time  interval.  The  information  symbols  are  assumed  to  be  statistically  independent  and 
equiprobable.  is  the  received  pulse  signature  associated  with  the  n-th  source.  The 

energy  of  hn{t)  is  normalized  to  1,  i.e.,  f^‘  hn(t)dt  =  1  and  is  the  energy  of  the  received 
signal  from  the  n-th  source,  assumed  orthogonal.  Cross-correlations  between  the  sequences 
are  included  in  the  matrix  A.  v{t)  is  (possibly  correlated)  additive  Gaussian  noise.  It  is 
assumed  that  A  is  strictly  diagonal  dominant,  i.e.,  |  |>  E  I  \  with  this  assumption  A 

is  invertible.  The  coupling  fixed  parameters  a^n  ai'e  riot  known.  The  received  signals  after 
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match  filtering  and  sampling  may  be  represented  by  the  vectors; 

x  =  AEb  +  v,  (2) 

where  A  is  the  mixture  matrix,  E  is  a  diagonal  matrix,  diagE  =  >  "s/^]  >  ^ 

[foi, . . . ,  bN^  is  the  vector  of  information  bits.  The  time  superscript  has  been  dropped  since 
the  decision  on  6^*^  requires  observation  of  signals  during  that  interval  only,  thus  b  represents 
the  vector  of  information  bits  at  any  time.  The  vector  v  consists  of  independent  samples 
of  zero-mean  Gaussian  noise  with  the  covariance  matrix  E  vv^  =  <t^G,  where  G  is  the 
known  matrix  of  the  noise  cross- correlations  between  the  outputs  of  the  matched  filter. 

Clearly,  the  minimum  square  error  criterion  leads  to  the  Wiener  solution  for  the  weight. 
The  corresponding  algorithm  for  controlling  the  weight  is  given  by 

Wn{k)  =  (I  -  ^x{k)x^{k))wn{k  -  1)  +  ^bn{k)x{k),  (3) 

where  bn{k)  =  sgnyn(^)  fh.®  decision  at  the  n-th  output  yk- 

It  was  also  shown  that  the  criterion  for  output  decorrelation  can  be  obtained  by  mini- 
mizing  the  cost  J  =  where  =  E[6x  ]  under  the  constraint  that  =  1.  The 

corresponding  algorithm  for  controlling  the  weight  is  given  by 

w„(A:)  =  1- B{k)x'^{k)^Wn{k  -  1),  (4) 

where  B  =  {I  -  u„u^)6  and  is  a  unit  vector  with  the  n-th  element  equal  to  1.  Erom 
comparing  (3)  and  (4)  it  was  possible  to  deduce  that  the  eigenvalue  spread  of  the  first 
algorithm  is  larger  than  that  of  the  second.  Simulation  results  of  learning  curves  are  given 
in  Fig.  1  and  Fig.  2  of  Appendix  A,  part  1.  Shown  in  Fig.  3  and  Fig.  4  is  an  estimate  of 
the  convergence  region  using  a  7  factor  defined  by  7  =  1  —Pcjlpd-,  where  pcf  and  pd  are  the 
final  and  initial  probabilities  of  error. 

In  conclusion,  this  study  showed  that  the  LMS  decorrelator  algorithm  is  faster  than  the 
LMS  error  algorithm  due  to  a  smaller  eigenvalue  spread.  We  introduced  a  new  RLS-type 
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algorithm  for  decorrelation.  The  RLS  decorrelator  was  shown  to  be  faster  than  the  LMS-type 
algorithms  and  of  comparable  speed  with  the  conventional  RLS  error.  Regions  of  convergence 
for  the  LMS  decorrelator  and  the  RLS  algorithms  were  shown  to  be  wider  than  those  of  the 
LMS  error  algorithm. 

In  the  second  part  of  Appendix  A  entitled  “Adaptive  Detectors  for  Multi-Channel  Signals 
in  Code  Division  Multiple  Access  System,”  both  the  feedforward  and  feedback  structure  are 
considered  for  implementation  in  CDMA  systems.  However,  an  assumption  was  used  that 
at  the  output  of  the  bank  of  matched  filters,  regularly  implemented  in  this  application,  the 
cross-correlation  matrix  A  =  A^j  is  unknown.  This  fact  might  be  cause  by  the  unknown  chan¬ 
nel  effect.  Nevertheless  this  fact  lead  to  a  problem  model  similar  to  that  of  the  previous  part 
of  the  appendix.  Four  adaptive  detection  criteria  are  considered:  two  each  for  the  forward 
structure  using  MSE  and  decorrelating.  The  other  implements  feedback  structures,  again 
with  MSE  and  decorrelating.  The  optimal  weights  for  all  these  algorithms  and  structures 
are  found.  Error  probability  is  used  as  a  performance  measure  for  comparison.  It  is  shown 
that  the  feedback  minimum  square  error  has  the  best  performance  in  terms  of  probability 
of  error.  This  might  be  due  to  the  fact  that  in  the  MSE  we  assume  the  availability  of  a 
reference  as  well  as  a  noise-free  case. 

2.2  Application  to  Signal  Separation  of  Multi-User  CDMA  Signals 

In  this  section  we  concentrate  on  signal  separation  of  a  multi-user  detector  for  CDMA  a 

system.  For  such  a  system,  the  receiver  signal  is  given  by 

k 

^(0  =  Z!  m  -  iT  -  Tk)  +  n(t),  (5) 

k=l  i 

where  bk{i)  E  {-1,  +1}  is  the  A;-th  user  data  bit  in  the  Ath  time  interval,  Sk{t),  k=l,  ...,K  is 
a  unit  energy  signature  waveform  of  duration  T  assigned  for  each  of  k  different  users.  n{t) 
is  a  zero-mean  white  Gaussian  noise  with  the  two-sided  power  spectral  density  No/2,  and 
Uk  and  Tk  are  the  received  energy  and  relative  delay  of  user  k.  The  energy  ak  is  assumed 
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unknown,  particularly  due  to  fading. 

If  r{t)  is  the  signal  received  at  the  main  station  (uplink)  then  ta,  might  be  different  for 
different  users.  Though  it  is  unknown,  sometimes  it  is  assumed  to  be  estimated  by  a  separate 
process.  If,  on  the  other  hand,  r{t)  is  the  received  signal  at  a  single-user  location  (downlink), 
then  Tk=0  for  every  k  and  the  signal  is  considered  synchronous. 

2.2.1  Adaptive  Separation  of  the  Synchronous  Multi-User  CDMA  Signal 

The  model  for  this  signal  separator  is  given  in  Figure  4.  Some  variation  to  this  scheme  will 
be  discussed  in  the  sequel.  Since  r/;=0,  k=l,  ...,  K  then  one  can  show  that  vector  x,  after 
the  matched  filters,  is  given  by; 

x{i)  =  PAh{i)  -f  n{i),  (6) 

where  x=[a;i, ...,  CCA'],  b=[6i, . . . ,  6^-],  A  =  dia.g{y/a^, . . . ,  y/aJc)  and  n=[ni, . . . ,  ua:].  The 
kj—th  element  of  the  symmetric  cross-correlation  matrix  P  is  defined  as 


Pk]  =  /  Sk{t)sj{t)dt  A:,j  e  (1,2,...AA 

JO 

(7) 

fT 

Uj  =  /  Sj(t)n{t)dt 

Jo 

> 

II 

(8) 

The  covariance  matrix  of  the  zero  Gaussian  noise  vector  n  is  given  by  (Ao/2)P. 

Multiplication  by  P~^  (assumed  known)  leaves  vector  z  having  uncorrelated  signals 

z  =  Ab  +  ,^,  (9) 

where  ^  —  P“^n. 

At  the  canceler  output  we  generate  an  output  by  multiplying  a  function  of  Zi  by 
weights  Wjk  and  subtract  the  result  from  Xk-  Therefore,  in  vector  notation 

Y  =  X  -  W^/(z),  (10) 
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where 


0 

W12  ■  ■ 

■  ■  WiK 

W2\ 

0  .. 

■  ■  W2K 

WKl 

WK2  ■ 

■  ■  wkk 

The  steepest  descent  algorithm  is  used  to  minimize  the  output  signal  power  E{yl): 


Wfc 


(11) 


where  is  the  A;-th  column  vector  of  W. 

Note  that  since  2;^,  j  =  l,  ...  K  k)  contains,  besides  the  data  bj,  a  noise  component 
the  error  performance  at  the  output  will  depend  strongly  on  SNR’s  at  Zj.  This  is  exactly 
what  is  known  as  power  inversion  behavior  inherent  in  LMS  and  minimum  pole  cancelers. 
To  improve  performance  sgnzj  is  used  instead  of  Zj.  Particularly  when  SNR  at  Zj  for  all  j 
are  high,  the  canceler  will  perform  at  its  best. 

In  Appendix  B,  Part  I,  we  analyze  the  separator  depicted  in  Figure  4;  we  derive  the 
optimal  weights  and  then  use  them  to  calculate  the  probability  of  at  the  A;-th  output  as 
a  function  of  cross-correlation  vector  pk  is  a  column  vector  obtained  from  P  by  deleting  the 
element  pkk- 

For  the  two-user  case,  we  show  in  Figure  5  the  probability  of  error  of  user  1  as  a  function 
of  SNR2  —  SNRi  with  p=0.7  and  5A^Pi=8dB.  For  comparison  the  probability  of  error  at 
the  output  of  decorrelator  is  also  shown.  As  expected  for  high  SNR2  the  estimate  of  62 
from  Z2  is  almost  perfect  and  the  canceler  performs  at  its  best  (ideal  separator).  For  a  very 
low  SNR2  —  SNRi  the  error  at  yi  is  better  due  to  low  interference  at  x,  the  output  of  the 
matched  filter. 

Figure  6  depicts  the  error  probabilities  when  all  (K-1)  users  except  user  1  maintain  the 
same  SNRi.  Gold  codes  are  implemented  in  this  figure  and  K  was  taken  to  be  2  to  5. 
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2.2.2  Adaptive  Separator  of  a  Synchronous  Multi-User  CDMA  Signal 
with  Soft  Decision 


As  stated  in  the  previous  section,  the  signal  separator  of  Figure  5  performs  better  at 
high  SNR^-  SNRi  and  at  low  SNRi  -  SNRi-  Using  a  hard  decision  (Signum  function)  at 
low  SNRi  is  not  advised,  due  to  a  higher  error  in  estimating  the  interference.  This  suggests 
using  a  soft  decision  instead.  With  this  we  have 

yk  =  Xk~  w;;hic,  (12) 

where 


I  zi/\ik  I  Zi  |<  m 

1  sgn{zi)  otherwise. 


(13) 


The  value  of  \ik  is  determined  heuristically  from  observed  values  of  decorrelator  outputs 


A/fc  =  pik 


(£^{l  l})^ 


1}  •  ^  ^ 
In  Appendix  B,  Part  II  we  analyze  this  separator,  we  derive  the  optimal  weights  and  use 
them  to  calculate  the  probability  of  pek  at  the  A:-th  output.  In  Figure  7  we  show  the  prob¬ 


ability  of  error  of  user  1  as  a  function  of  SNR2  —  SNRi  with  p  —  0.7,  SNRi  —  8dB  for 
the  two-user  case.  More  results  are  also  shown  m  this  appendix.  For  comparison  we  also 


add  to  this  figure  the  performance  of  the  decorrelation  and  the  canceler  with  a  hard  limiter. 
It  is  easy  to  recognize  that  using  a  soft  limiter  improves  performance  for  almost  all  SNR2. 
Particularly  noticeable  is  the  improvement  at  a  low  SNR2  value,  where  the  hard  limiter 


results  in  poor  performance. 
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2.2.3  Adaptive  Separator  of  Synchronous  Multi-User  CDMA  Signals 
Using  the  Bootstrapped  Decorrelating  Algorithm 

Here,  instead  of  using  an  algorithm  that  minimizes  the  output  signal  power  E{yl)  (see 
equation  11),  we  use  a  steepest  descent  algorithm,  which  simultaneously  reduces  the  absolute 
value  of  the  correlation  between  the  output  yk  and  the  decision  on  all  other  outputs,  he., 

yE{ynSgn{yk)].  (15) 

This  is  what  we  referred  to  in  the  previous  report  as  the  forward/forward  bootstrapped 
algorithm.  In  Figure  8  we  depict  the  block  diagram  of  this  separator.  For  obtaining  the 
optimal  weight  one  has  to  solve  a  system  of  nonlinear  equations.  However,  if  we  assume  the 
signal-to-noise  ratio  at  the  output  is  such  that  the  main  contribution  to  the  output  error  is 
due  to  multiuser  interference,  then  we  can  use  the  following  approximation: 

E{yksgnyk)  ^  E{ykhk){l  -  2Pek),  (16) 

where  Pek  is  a  diagonal  {K  —  1)  x  {K  —  1)  matrix  whose  elements  are  Pej,  k  =  1)  2,  ... 
K  —  1,  K  +  1,K.  This  approximation  changes  the  nonlinear  system  of  the  equation  to  a 
tractable  linear  system.  In  Appendix  B,  Part  HI  we  present  analysis  of  this  separator,  we 
derive  the  optimal  weights,  then  we  calculate  the  probability  of  error.  In  Figure  9  we  show 
the  results  of  this  separator  for  the  two-user  case. 

2.2.4  One-Shot  Adaptive  Separator  of  an  Asynchronous  Multi-User  CDMA 
Signal 

Different  from  the  synchronous  separator,  some  or  all  Tk  in  (5)  are  non-zero.  However,  we 
will  assume  them  to  be  known.  Without  loss  of  generality  we  concentrate  on  bit  0  of  user  1, 
and  assume  that  0  =  ti  <  T2  <  . . .  <  Tk  <  T.  The  sampled  output  of  the  matched  filter  for 
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user  1  is  then 


K 

x-i(O)  =  ^/h76i(0)  +  ^  v^[/9fci6fc(-l)  +  PiMO)]  +  m(0). 

A:=l 

The  normalized  partial  cross-correlations  pki  and  for  k  =  2  ,  K  are 

T 

Pki  —  (  si{t)sk{t  +  T  —  Tk)dt 

Jo 

rT 

Pik  =  /  Si{t)Sk{t  -  Tk)dt, 

Jo 


(17) 

(18) 
(19) 


and 

ni(0)  =  /  n{t)sT,{t)dt 
Jo 

is  a  zero-mean  Gaussian  random  variable  with  a  variance  of  No/2.  In  matrix  notation: 


xi(0)  =  v^6i(0)  +  pf  Abi(O)  -f  ni(0),  (20) 

where  A  =  » 

0  A 

and  Pi=  [  P2ir”  d2i)  Pi2)  •••)  Pik  ]•  The  separator  forms  an  estimate  of  the  multiuser  interfere 
as  the  weighted  vector  of  tentative  decisions  on  symbols  that  interfere  with  &i(0)  directly, 
and  subtracts  it  from  the  matched  filter  output. 

Except  for  the  large  dimension  of  the  weighted  vector,  the  canceler  section  is  the  same  as 
in  the  synchronous  case.  Obviously,  depending  on  t^,  the  vector  pk  will  change  the  relative 
powers  in  the  current  and  previous  bits  of  the  interferers  and  so  the  performance  of  the 
canceler. 


,  Ai  =  diag  [>!,.. -v^],  b2(0)=  [b2(-l),  ...,  b,(-l),  b2(0),  ...  bj,(0)] 


In  Appendix  B,  Part  IV  we  present,  some  analysis  and  results,  primarily  with  two  users 
of  this  separator.  Figure  10  shows  the  probability  of  error  of  use  1  versus  SNRi  for  the  two 
asynchronous  users.  The  relative  energy  is  defined  as  /.J  sl{t)dt.  For  comparison  we  add  to 
this  figure  the  conventional  single-user  detector  and  the  single-user  bound  when  interference 
is  absent. 
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2.2.5  Adaptive  Bootstrap  Decorrelator  for  Multi-User  CDMA  in  an  Asynchronous 
Unknown  Channel 

In  any  of  the  previously  discussed  adaptive  separators,  the  principle  idea  is  that  of  using 
a  bank  of  matched  filters  at  the  input  of  the  receiver.  If  the  users  are  synchronous  and 
the  codes  are  known,  then  besides  noise,  the  outputs  of  the  filters  contain  a  known  mix  of 
the  different  users’  bits.  This  mixture  depends  on  the  cross-correlation  of  the  codes.  Linear 
transformation  on  these  outputs  with  the  inverse  of  the  cross- correlation  matrix  will  generate 
outputs  that  contain  a  separate  signal  (decorrelator).  The  separation  is  inadequate  because 
of  the  high  noise  contamination.  A  tentative  decision  followed  by  an  adaptive  canceler  did 
the  job  of  reducing  the  interference  at  the  output  of  the  matched  filter  without  having  high 
noise. 

If  on  the  other  hand  the  users  are  asynchronous  but  the  relative  delays  and  the  codes  are 
known,  then  the  one-shot  approach  will  again  generate  a  known  mixture  at  the  outputs  of 
the  bank  of  the  matched  filter,  which  corresponds  to  each  user  bit  0  (for  example)  and  bit 
(0)  and  bit  (-1)  of  other  users.  Again,  linear  transformation  by  the  inverse  of  this  generalized 
cross-correlation  matrix  will  result  in  an  output  with  uncorrelated  signals.  Then,  as  before, 
a  tentative  decision  followed  by  an  adaptive  canceler  will  do  the  job. 

For  the  asynchronous  unknown  relative  delay  the  mixture  at  the  outputs  of  the 
matched  filters  are  unknown.  This  is  also  the  case  if  r^’s  are  estimated  by  some  process,  then 
there  will  be  some  estimation  error.  For  these  cases  we  suggest  using  the  bootstrapped  algo¬ 
rithm  for  decorrelating  the  outputs  of  the  matched  filters.  Such  a  decorrelation  is  proposed 
in  Appendix  B,  Part  V. 

The  principle  operation  of  this  decorrelator  (see  Figure  11)  is  weighting  the  outputs  of  the 
matched  filter  by  a  matrix  W,  the  same  as  in  (10),  while  the  next  and  previous  bit  by  upper- 
and  lower-triangular  parts  of  the  other  matrix  W,  that  is,  a  total  of  2(K-1)  weights  for  each 
user  output  (total  of  2K(K-1))  weights.  The  adaptive  algorithm  then  controls  the  weight 


matrices  W  and  W  to  reduce  the  absolute  value  of  the  correlation  between  the  decorrelation 
output  ym{i)  and  all  other  outputs  of  the  decorrelator  (after  decision)  at  time  r,  i-1  and  i+1 
respectivel}^  The  weights  Wmk  and  w'^nk  for  1  j  K,  m  <  K,  a  /  m  are  controlled. 

Wmk  ^  Wmk  + yE{ym{i)sgn{yk{i))) 

Wmk  <-  Wmk  +  yE{ym{i)sgn{yk{i  -  1)))  k>m 

Wmk  ^  Wmk  + gE{ym{i)sgn{yk{i  +  l)))  k  <  m.  (21) 

Note  that  for  the  two-user  case  K=2,  there  are  4  weights,  wi2,  W21,  Wu  and  W21. 

In  Appendix  B,  Part  V  we  present  a  more  detailed  analysis  of  this  decorrelator.  Par¬ 
ticularly  for  the  two-user  case,  we  found  the  optimal  weight  and  use  it  to  derive  the  error 
probability  of  user  1  at  the  output  of  the  decorrelator.  This  appendix  also  contains  some 
simulations  and  analytical  results,  and  compares  them  to  the  results  obtained  with  the  one- 
shot,  when  Tfc  is  known. 

2.2.6  Convergence  and  Stability  Analysis  of  the  Adaptive  Separators 

In  Appendix  B,  Part  VI  we  consider  the  question  of  convergence  of  the  weight  control 
algorithm  to  the  optimal  value.  Although  this  is  done  for  the  synchronous  case  when  the 
algorithm  minimizes  the  output  power,  the  derivation  can  be  extended  to  the  decorrelation 
algorithm  with  appropriate  linearization.  Extension  includes  the  possibility  of  the  one-shot 
asynchronous  case.  This  study  concludes  with  a  condition  on  the  learning  rate  g. 

In  summary,  we  presented  in  this  section  two  approaches  for  adaptive  separation  of  multi¬ 
user  CDMA  signals,  and  we  show  their  performance  in  terms  of  error  jjrobability.  All  these 
separators  are  blind  cancelers,  since  they  only  use  the  received  signal  without  the  help  of 
training  signals.  Some  of  the  algorithms  used  a  fast  algorithm;  others  are  minimum  power 
algorithms  whose  performance  is  enhanced  with  tentative  predecisions. 


2.3  Blind  Equalization  of  Dispersive  Communication  Channels 


In  digital  communication  the  need  for  blind  equalization  arises  when  the  channel  is  un¬ 
known  to  the  receiver,  which  attempts  to  estimate  a  sequence  of  transmitted  data  without 
resorting  to  the  use  of  a  training  sequence.  The  main  problem  of  blind  equalization  is  that  of 
finding  an  appropriate  cost  function  (or  equivalent  error  function)  that  reflects  the  amount 
of  intersymbol  interference  (ISI)  introduced  by  the  channel,  and  which  does  not  involve  the 
transmitted  symbol.  Optimization  of  the  cost  function  should  lead  to  minimization  of  the 
ISI.  Many  researchers  dealt  with  blind  equalization  and  the  ill  convergence  problem,  which 
was  treated  with  modiflcations  suitable  for  the  case,  but  with  extra  complexity. 

The  main  emphasis  of  their  research  is  on  the  linear  equalizer  structure.  Another  very 
important  structure  in  digital  communication,  the  decision  feedback  equalizer,  receives  less 
attention. 

As  part  of  our  current  research  effort,  we  suggest,  implement  and  study  of  the  boot¬ 
strapped  decorrelation  algorithm  in  conjunction  with  decision  feedback  equalization.  An¬ 
other  topic  of  blind  equalization  considered  in  this  phase,  although  not  related  to  the  decor¬ 
relation  algorithm,  is  the  “Anchored  Blind  Equalizer  using  the  Constant  Modulus  Algorithm” 
for  improving  performance. 

These  two  topics  belong  to  a  larger  class  of  symbol-by-symbol  equalization.  The  second 
topic  involves  maximum  likelihood  sequence  estimation  (MLSE),  where  the  Viterbi  algorithm 
(VA)  is  used  to  search  a  trellis  for  the  desired  sequence.  The  blind  maximum  likelihood 
(ML)  sequence  estimation  problem  is  discussed  in  section  2.3.3  and  the  corresponding  part 
of  Appendix  C.  Here  the  linear  dispersive  channel  model  is  assumed  unknown,  but  it  has 
quantized  parameters.  A  channel  trellis  and  a  data  trellis  are  defined  to  search  for  the  ML 
channel  and  data  estimates  using  the  Viterbi  algorithm.  The  approach  provides  a  good 
performance/complexity  tradeoff. 


In  contrast  to  symbol- by- symbol  estimation,  maximum  likelihood  sequence  estimation 
has  a  lower  error  rate  at  the  expense  of  complexity.  The  complexity  of  the  VA  grows  ex¬ 
ponentially  with  the  length  of  the  channel  response.  When  the  impulse  response  becomes 
larger,  the  VA  becomes  impractical,  and  methods  for  complexity  reduction  are  needed.  In 
section  2.3.4  and  in  the  corresponding  part  of  Appendix  C,  we  present  a  new  method  of 
complexity  reduction  of  such  an  equalizer. 

2.3.1.  Blind  Decision  Feedback  Equalization  Based  on  Decorrelation 

The  structure  of  such  an  equalizer  is  shown  in  Figure  12.  Ihe  cascade  of  the  transmit  channel 
and  receiver  filters  is  modeled  as  an  FIR  filter  with  impulse  response 

h{n)  =  1  +  XI 

1  =  1 

where  ^(•)  is  the  kronecker  delta.  It  is  also  assumed  that  the  input  h  is  a  binary  white 
sequence  with  a  zero  mean.  The  output  of  the  channel  is  thus  given  by 

Xk  =  h  +  Y1 

i=l 

Clearly  the  post-cursors  (/ji, /i2)  •••?  ^iv)  introduce  ISI.  The  estimated  data  is  produced 
by  passing  Ak  through  a  sheer.  The  input  to  the  sheer  of  the  decision  feedback  equalizer  Ak 
is  given  by 

Ak  =  Xk  —  A-T-i  W 

=  AT-iW,  (24) 

A  A  A  A 

where  Afc_i  is  the  vector  of  the  past  N  decisions  A'k-i=[Ak-i,Ak-i,  ■  ■  .,Ak-N]  (the  prime 
stands  for  transpose)  and  Ifc_i  is  the  vector  of  past  transmitted  information  bits  Tk-i  =  [ 
Ik-i,Ik-2,  -  ■  •  ,h-N  ],  where  h-i  G  {  -1,  1  }  and  P{Ik-i  =  1}  =  P{/a;-i  =  -1}  =  1/2.  W 
and  H  are  the  equalizer  and  channel  parameter  vectors,  respectively;  W’  =  [  rci,  tC2,  ■  ■  • ,  wn] 
and  H’  =  [/ii,  /12,  ■  ■ . ,  hj^]. 


b'S 


A  decorrelation  algorithm  which  uses  AkAk-i  as  an  error  function  adaptively  controls  the 
equalizer  weight  according  to 

+  liA^,  (25) 

In  a  practical  implementation  one  would  replace  the  expectation  by  the  current  realiza¬ 
tion,  leading  to  a  stochastic  difference  equation. 

For  ideal  ISI  cancellation,  the  sheer’s  input  Ak  =  4,  and  therefore  sequence  {Ak}  will  be 
decorrelated,  f.e.,  AkAk-i  =0  for  n=0.  In  other  words,  decorrelation  is  a  necessary  condition 
for  ideal  cancellation  of  ISI.  In  order  to  be  able  to  use  the  decorrelation  of  the  sheer’s  input  as 
a  criterion  for  controlling  the  feedback  weight  vector  W,  one  must  prove  that  decorrelation 
is  also  sufficient  for  cancelling  ISI.  This  is  done  in  detail  in  Appendix  A,  Part  I.  Simulation 
performed  on  different  channel  models  showed  convergence  to  the  desired  values  of  weights, 
and  hence  ISI  cancellation.  It  was  also  shown  that  the  decorrelation  algorithm  converges  to 
the  correct  weight  irrespective  of  the  initial  condition. 

Upper  bounds  and  lower  bounds  on  the  probability  of  bit  error  for  the  AWGN  case  with 
a  zero  mean  and  a  variance  of  cr^,  was  found.  A  channel  whose  transfer  function  is  given  by 
H(2:~^)  =  1  =  0.8  z~^  +  0.6  +0.4  z“^,  is  depicted  in  Figure  13,  these  bounds  together  with 

results  of  simulation  of  this  channel  and  the  no  ISI  probability  of  error  are  shown  for  compar¬ 
ison.  Due  to  the  simplicity  of  the  decorrelation  algorithm,  one  could  derive  a  Kalman-type 
decorrelation  blind  equalizer,  which  increases  the  convergence  substantially.  This  is  included 
in  Appendix  C,  Part  I. 

2.3.2  Anchored  Blind  Equalization  Using  the  Constant  Modulus  Algorithm 

As  stated  earlier,  most  algorithms  for  the  blind  equalizer  in  the  literature  use  a  non-convex 
cost  function  that  possesses  local  minima  to  which  the  equalizer  may  converge.  Some  of  these 
equilibria  may  be  undesirable,  i.e.,  they  will  not  be  able  to  remove  ISI.  For  these  algorithms. 
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equalizer  initialization  becomes  an  important  issue.  Verdu  showed  that  anchoring  (setting 
the  first  coefficient  to  one)  will  result  in  global  convergence.  Verdu  used  the  minimum  energy 
(ME)  as  a  cost  function.  An  important  algorithm  developed  by  Triechler  is  the  constant 
modulus  algorithm  (CMA)  which,  like  the  others,  suffers  from  ill- convergence.  In  Appendix 
C,  Part  II,  we  suggest  an  anchored  constant  modulus  algorithm  (ACMA)  for  the  linear  and 
decision  feedback  blind  equalizers  (see  Figure  14,  for  example). 

Detailed  analysis  of  this  algorithm  is  given  in  this  appendix  for  equalizing  auto-regressive 
channels  and  moving  average-type  channels.  It  is  shown  analytically  and  through  simulation 
that  the  algorithm  converges  successfully  if  the  unknown  channel  gain  exceeds  a  certain  value 
(l/-\/3).  The  controlled  weight  will  fail  to  converge  to  the  desired  value  if  the  channel  gain 
drops  below  this  value.  This  problem  can  be  minimized  if  we  introduce  a  gain  in  front  of  the 
equalizer.  In  comparing  the  ACMA  to  that  of  AMEA  we  notice  that  the  former  converges 
faster  than  the  latter.  With  AMEA  we  notice  that  the  former  converges  faster  than  the 
latter  (see  Figure  16).  The  AMEA,  however  is  showing  no  ill  convergence  for  a  small  g,  but 
like  the  ACMA  rate  of  convergence,  it  decreases  with  decreasing  g. 

2.3.3  Blind  Maximum  Likelihood  Sequence  Estimation 

For  sequence  estimation  of  digital  data  in  the  presence  of  intersymbol  interference,  both  the 
data  and  the  channel  might  be  considered  unknown.  To  facilitate  the  proposed  approach 
of  this  research,  we  assumed  a  quantized  channel.  This  is  justified  in  practice  since  a  finite 
precision  processor  is  used  to  implement  the  algorithm.  Two  trellises  are  developed;  one 
for  the  channel  and  the  other  for  the  data.  The  VA  is  used  to  search  for  the  ML  channel 
and  data  sequence  estimates.  The  output  of  one  trellis  is  fed  into  the  metric  calculator  of 
the  other.  The  resulting  scheme  offers  a  considerable  reduction  in  computational  complexity 
compared  with  other  methods  available  in  the  literature,  and  prevails  with  regard  to  the 
complexity / performance  tradeoff. 
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The  sampled  output  of  the  channel,  r*.  at  instant  A;,  is  given  by 


rit  =  rT^h  +  nfc,  (26) 

Ifc  =  [h,  h-ii  ■  ■  •  1 1k-i]l  ,  h  —  [  /io,  Ail,  ■  •  • ,  Ain]/,  and  {hi  is  the  sampled  impulse  response 
of  the  cascaded  transmit  channel  and  receiver  filters  are  assumed  to  be  slowly  time  varying. 
{Ik-i}  is  the  sequence  of  transmitted  symbols;  assumed  to  be  iid  random  variables  whose 
values  are  {±1,  ±3, . . . ,  ±(Af  —  1)}  with  equal  probability,  and  {uk}  is  an  additive  white 
Gaussian  noise  sequence. 

For  the  blind  equalization  problem  at  hand,  we  consider  the  conditional  probability  of 
the  received  sequence  r  given  the  transmitted  seciuence  and  the  channel  impulse  response. 
With  all  channel  realizations  equally  probable,  the  joint  ML  estimate  for  the  data  sequence 
1ml  and  the  channel  parameter  Ha/l  is  given  by 

{Iml.I^ml)  =  argrnax/r|i,h(r|I,h),  (27) 

wherein  the  maximization  is  carried  over  all  possible  h  and  I.  /r|i,h('  I  ’, ')  conditional 
probability  density  function  (pdf)  of  the  received  sequence  r.  Such  a  problem  is  not  trivial 
since  h  is  continuous  and  I  is  discrete. 

The  channel  parameters  are  approximated  by  discrete  values  from  the  infinite  alphabet 
{0,  ±c,  ±2c, . . .  },  where  c  can  be  chosen  as  arbitrarily  small.  The  corresponding  channel 
trellis  will  have  an  infinite  number  of  states.  However,  since  the  channel  vector  h  does  not 
vary  over  each  signal  interval,  we  need  not  consider  all  its  possible  levels  at  a  given  instant. 
We  proposed  a  simple  assignment  scheme  for  the  channel  trellis.  The  next  channel  estimate 
h«+i,  ]g  given  by 

=  h'  for  state  0,  and 
=  h*±cl,i  for  state  n  =  1, 2, . . . , 

where  1„  is  a  vector  of  length  L,  whose  elements  are  either  O’s  of  I’s. 
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A  smaller  number  of  states  can  be  used  if  the  vector  1„  is  restricted  to  be  all  zero,  except 
for  the  element  at  the  n-th  location  to  unity.  This  assignment  results  in  T  +  2  states.  There 
are  other  schemes,  up  to  2^+^  states,  but  the  above  assignment  will  result  in  a  simple  trellis. 

In  summary,  the  algorithm  would  proceed  as  follows: 

1.  Start  with  initial  channel  estimate,  h°, 

2.  Use  the  VA  to  solve  for  Iml  =  argmaxi/(r|I,  Iiml), 

3.  Use  the  VA  to  solve  for  Hml  =  argmaxh /(r|lML,  hML), 

4.  Iterate  2  and  3. 

The  algorithm  achieves  the  ML  estimate  of  the  channel  by  incrementing  or  decrementing 
the  previous  estimate.  In  each  transition  only  one  channel  parameter  is  changed.  To  improve 
the  speed  of  convergence  one  can  include  more  states  in  the  channel  trellis,  which  allows 
for  changing  two  or  more  parameters  at  a  time,  at  the  expense  of  complexity.  Thus,  one 
can  compromise  the  rate  of  convergence  for  complexity.  The  step  parameter  c  affects  the 
performance.  Choosing  a  smaller  c  will  reduce  the  rate  of  convergence,  but  it  will  improve 
the  error  rate. 

The  algorithm  described  above  was  used  to  equalize  the  channel  (assumed  unknown) 
whose  sampled  impulse  response  is  given  by 

h{n)  =  0.405(5(n)  +  0.817(5(n  -  1)  +  0.407(5(n  -  2). 

For  simplicity,  binary  data  is  assumed.  The  channel  and  data  trellises,  each  having  four 
states,  are  shown  in  Figure  15.  Clearly,  using  smaller  c  results  in  higher  complexity  but 
better  performance. 
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2.3.4  Reduced  Complexity  Sequence  Estimation  Using  State  Partitioning 

As  was  stated  earlier,  the  maximum  likelihood  sequence  estimator  has  a  lower  error  rate  at 
the  expense  of  complexity.  The  complexity  of  the  VA  grows  very  fast  with  the  length  of 
the  channel  and  or  the  size  of  the  constellation.  Many  researchers  have  been  considering 
the  problem  of  reducing  state  complexity  and  came  out  with  different  proposals.  Some  of 
these  are  useful  for  systems  utilizing  a  larger  signal  constellation,  while  others  are  suitable 
for  channels  with  a  long  impulse  response.  We  proposed  a  state  reduction  method  which 
is  based  on  a  state  partitioning  technique,  which  can  handle  both  a  large  constellation  and 
long  impulse  response  channels.  Moreover,  it  is  richer  in  the  possible  number  of  reduced 
states  (not  only  power  of  M,  where  M  is  the  alphabet  size).  Therefore,  it  enables  a  more 
flexible  complexity /performance  tradeoff. 

The  discrete  time  channel  model  considered  here  is  given  in  Figure  16.  This  model  arises 
in  a  quadrature  amplitude  modulation  (QAM)  system  at  the  output  of  a  sampled,  whitened 
matched  filter.  The  channel  h[D)  is  modeled  as  a  finite  response  filter  (FIR),  and  n{D)  is  a 
white  Gaussian  noise  source  with  a  zero  mean  and  variance  of  cr^.  The  data  sequence  a(U) 
consists  of  symbols  which  are  independent  and  identically  distributed.  We  will  assume 
binary  transmission  here. 

The  data  symbols  ak  take  values  of  ±1  with  equal  probability.  Referring  to  Figure  17, 
the  output  y{D)  is  given  by  y{D)  =  a{D)h{D)  +  n{D),  where  h{D),  given  by  h{D)  = 
ho  +  hiD  + . . .  +  hmD'^,  defines  the  channel  impulse  response,  whose  degree  is  determined  by 
the  channel  memory.  The  state  of  the  system,  s{k)^  at  time  k  is  given  by  {ak-i,ak-2-,  •,  Ofc-m)- 
Denote  the  set  of  all  2™  possible  states  by  £]  then 

£  =  {s,-  :  Si  is  a  state  of  the  system,  i  =  0, 1, . . . ,  2’"  —  1). 

In  the  proposed  method,  the  set  £  is  partitioned  into  N,  Si,  partitions,  where  N  is  2  <  N  < 
2™,  such  that 

1.  US'  Si  =  m, 
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2.  Si  n  Sj  =  0;  the  empty  set  for  i  /  j  and  0  <  i,j  <  N  —  1, 

3.  The  partitions  Si  are  chosen  such  that  for  all  Sn{k)  €  Si,  the  corresponding  next  state 
Sn{k  +  1)  must  belong  to  the  same  partition. 

In  particular,  the  third  condition  contains  the  partition  in  a  way  that  enables  a  trellis  to  be 
defined.  A  procedure  was  devised  in  which  the  partitions  satisfy  the  third  rule.  Detailed 
information  is  given  in  Appendix  C,  Part  IV. 

In  this  appendix  we  show  how  the  method  can  be  applied  to  obtain  trellises  with  2® 
states  as  well  as  trellises  with  the  number  of  states  not  2b  Bounds  on  the  error  probability 
of  these  cases  are  also  found.  Applications  to  non-binary  (M-PAM,  M-QAM  and  MPSK) 
modulation  are  also  discussed.  Simulations  were  performed  and  some  results  are  depicted  in 
this  appendix.  The  channel  impulse  response  used  was 

h{n)  =  0.7107h(n)  -F  0.1421^(n  -  1)  +  0.2l326{n  -  2)  +  0.1421h(n  -  3)  +  0.6396h(n  -  4). 

We  present  in  Figure  17  the  simulation  result  and  upper  bound  for  different  numbers  of 
states.  As  expected  with  12-  and  14-state  trellises,  the  probability  of  error  is  better  than  in 
the  8-state  case. 

III.  CONCLUSIONS  AND  RECOMMENDATIONS 

During  this  period  of  research  we  made  significant  progress  in  applying  the  bootstrap 
algorithm,  particularly  that  based  on  decorrelation,  to  two  important  area:  separation  of 
multi-user  CDMA  signals  and  equalization  of  dispersive  communication  channels. 

We  laid  the  ground  work  for  the  adaptive  separator  of  a  synchronous  channel  and  exam¬ 
ined  the  effect  of  predecisions,  hard  or  soft  limiters,  on  the  performance.  We  also  examined 
the  use  of  the  decorrelator  for  this  separator.  For  the  asynchronous  case  we  presented  the 
adaptive  separator  based  on  the  one-shot  approach,  which  turns  k  asynchronous  users  with 
known  relative  delay  into  2K—1  synchronous  users.  If  the  relative  delays  are  unknown  then 
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this  approach  can  not  be  used.  Instead  a  special  arrangement  of  the  decorrelator  algorithm 
is  possible.  Some  preliminary  results  were  obtained  using  this  approach.  A  convergence  and 
stability  analysis  was  also  performed,  and  conditions  were  established  for  the  synchronous 
separator  convergence  case. 

Symbol-by-symbol  blind  channel  equalization  was  also  considered.  Using  the  decorrela¬ 
tion  algorithm,  inter-symbol  interference  was  reduced.  Particularly,  for  the  first  time,  for  the 
decision  feedback  structure.  The  approach  proves  to  achieve  global  convergence  in  contrast 
to  other  methods.  Some  results  on  anchoring  the  constant  modulus  were  also  obtained. 

For  blind  maximum  likelihood  sequence  estimation  we  introduced  the  idea  of  quantiz¬ 
ing  the  channel  response  to  be  able  to  perform  an  estimation  of  the  data  sequence  when 
the  channel  is  unknown.  A  method  of  using  two  simultaneous  trellises,  one  for  data  and 
one  for  channel  response,  was  introduced  and  shown  to  result  in  a  considerable  reduction 
of  computational  complexity  and  it  prevails  with  regards  to  the  complexity/performance 
tradeoff. 

We  then  introduced  a  method  of  reduced  complexity  sequence  estimation  which  can  be 
used  for  both  a  long  channel  response  and  a  large  constellation.  It  also  provides  more  freedom 
in  number  of  reduced  states  and  gives  more  flexibility  in  design. 

In  summary,  during  this  period  of  research,  aside  from  adding  information  on  issues  of 
implementation,  we  have  a  vast  amount  of  results  in  the  two  main  topics  of  applications.  A 
list  of  pubications  resulting  from  this  research  is  given  in  Section  V. 

The  work  carried  out  led  us  to  make  a  number  of  recommendations  for  further  study. 

1.  Adaptive  threshold  level  for  the  separators  of  multi-user  CDMA  signals 
with  soft  decisions: 

It  was  claimed  in  section  2.2.2  that  soft  decision  improves  performance  of  the  syn¬ 
chronous  separator,  particularly  at  a  region  with  a  low  interference  level.  The  threshold 
level  of  the  soft  decision  was  determined  heuristically  from  the  observed  values  of  the 
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decorrelator  outputs.  We  suggest  to  search  for  optimum  threshold  (adaptive)  to  obtain 
the  best  performance  possible  with  predecisions,  and  hence  obtain  the  best  separator 
of  this  kind. 

2.  Separator  of  asynchronous  multi-user  CDMA  signals  with  uncertainty  in 
relative  delays; 

Some  preliminary  results  were  obtained  in  section  2.2.4.  However,  even  with  the  one- 
shot  approach,  the  relative  delays  obtained  from  some  measurement  may  contain 
some  uncertainty.  In  this  regard: 

•  One  may  want  to  examine  the  effects  of  uncertainty  on  the  performance  of  the 
separator, 

•  Suggest  an  adaptive  structure  to  reduce  the  effect  of  this  uncertainty, 

•  Examine  the  possibility  of  using  this  adaptive  structure  for  handling  the  asyn¬ 
chronous  multi-user  case  with  unknown  relative  delays,  and  avoid  the  need  for 
separate  estimators  for  finding  these  delays. 

Preliminary  estimation  of  these  problems  suggest  that  the  bootstrapped  algorithm  can 
be  advantageous  in  solving  them. 

3.  Effect  of  channel  fading  and  multi-path  on  the  separator’s  performance: 

All  work  previously  done  assumed  that  the  channel  was  unfaded  and  slowly  varying. 
In  reality,  channels  are  stochastic  and  suffer  from  fading.  Models  for  these  channels 
are  needed  to  examine  their  effects  on  performance.  This  is  an  important  topic  for 
extending  current  results. 

4.  Multi-Carrier,  multi-user,  CDMA  separator: 

Especially  for  a  multi-path  channel,  a  complex  RAKE  receiver  was  proposed.  Multi¬ 
carrier  CDMA  turns  the  wideband  CDMA  full  channel  into  many  narrow  band  chan- 
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nels.  Therefore,  multi-path  becomes  a  flat  fade  of  the  different  narrow-band  channels, 
and  hence  it  illuminates  the  need  for  a  RAKE  receiver.  Again  some  preliminary  results 
were  obtained  with  this  structure.  The  bootstrap  decorrelating  algorithm  was  proven 
to  be  crucial  in  its  implementation. 

5.  Further  work  on  equalization  using  the  decorrelation  algorithm: 

For  most  practical  communication  channels,  the  response  is  an  FIR  minimum  or  no 
minimum  phase  with  a  post-  and  pre-cursor.  In  section  2.3.1  we  modeled  the  channel 
as  FIR  with  only  post-cursors  and  showed  that  the  blind  decision  feedback  equalizer 
can  be  used  successfully  with  the  decorrelation  algorithm  for  the  linear  equalizer,  with 
or  without  a  pre-cursor  remained  open.  More  work  in  this  direction  is  needed. 

6.  SPW  Modules: 

Further  implemention  the  results  of  this  report  with  modules  of  the  signal  processing 
workstation  (SPW)  are  needed. 
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Figure  4  Adaptive  Separator  of  Synchronous  Multi-User  CDMA  Signals 
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Figure  5  Error  Probability  of  User  1  for  K  =  2,  and  SNRi  =  8  dB  and  pi2 
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Figure  6  Error  probability  of  User  1  for  k—2  to  k=5 


Figure  7  Effect  of  Soft  Limiter  on  Adaptive  Separator. 
Error  probability  of  User  1  for  K  =  2,  and  SNRi  =  8  dB  and  pu 


Figure  8  Adaptive  Separator  of  Synchronous  Multiuser  CDMA  Signal  Using  the 

Bootstrapped  Decorrelating  Algorithm 


Figure  9  Performance  of  Separator  using  Decorrelating  Algorithm. 
Probability  of  Error  for  User  1  (K=2,  SNRi  =  8  dB,  pi2  =  0.7) 
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Figure  10  Performance  of  One-Shot  Separator  of  Asynchronous  Signals. 
Error  Probability  of  user  1  for  K=2,  pi2  =  0.2,  p2\  —  0.6,  ei  =  0.4 
(1)  a2/«i  =  0.6  (2)  ai/a2  =  0.6 


Figure  11  Proposed  Bootstrapped  Decorrelator  for  Asynchronous  Unknown  Channel 
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Figure  13  Performance  of  Decision  Feedback  Equalizer  with  Decorrelation  Control. 


Figure  14  The  Anchored  DFE  using  the  CMA 
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Figure  16  Discrete  Channel  model 


Figure  17  Performance  of  Reduced  Complexity  Sequence  Estimate 
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APPENDIX  A:  PART  I 


FAST  DECORRELATION  ALGORITHMS  FOR  SIGNAL  SEPARATION 


by 

Alex  Haimovich,  R.  Manzo  and  Yeheskel  Bar-Ness 

ABSTRACT 


LMS  and  RLS  type  algorithms  are  snggested  for  decorrelation  of  multi-channel  systems 
outputs.  These  algorithms  act  as  signal  separators  when  applied  to  unknown  linear  combi¬ 
nations  of  the  inputs.  The  performance  of  the  suggested  algorithms  is  compared  with  that 
of  the  conventional  LMS  and  RLS  algorithms  that  minimize  the  mean  square  error.  It  is 
shown  that  the  correlation  matrix  eigenvalue  spread  associated  with  the  LMS  decorrelator  is 
always  smaller  than  the  eigenvalue  spread  corresponding  to  the  conventional  LMS,  resulting 
in  faster  convergence  speed  for  the  decorrelator.  A  new  RLS  type  decorrelator  algorithm 
is  suggested.  The  RLS  decorrelator  is  shown  to  be  faster  than  the  LMS  decorrelator,  not 
affected  by  the  eigenvalue  spread,  and  comparable  in  speed  with  the  conventional  RLS  al¬ 
gorithm.  Convergence  analysis  by  simulation  shows  that  the  RLS  algorithms  and  the  LMS 
decorrelator  have  wider  regions  of  convergence  than  the  conventional  LMS. 

I.  INTRODUCTION 

The  separation  problem  addressed  in  this  paper  is  the  recovery  of  unknown  independent 
sources  from  observations  of  a  linear  mixture  of  the  sources.  We  are  concerned  with  a 
multi-channel  input  multi-channel  output  system  where  each  output  is  an  unknown  linear 
combination  of  the  inputs.  This  type  of  problem  arises  in  numerous  applications  such  as 
canceling  cross-polarization  interference  in  dual  polarized  systems,  detection  in  multiple 
access  communications  systems,  and  separating  multiple  speech  signals,  to  name  a  few. 

One  approach  to  signal  separation  is  to  view  it  as  an  interference  cancellation  problem. 
The  signal-to-noise-and-interference  ratio  at  each  output  channel  is  enhanced  by  suppressing 
the  co-channel  interference.  When  a  reference  signal  is  available,  the  mean  square  error 
(MSE)  between  the  output  and  the  reference  signal  is  minimized  by  the  classical  Wiener 
filter,  which  can  be  implemented  using  steepest  descent  algorithms  such  as  the  LMS  and 
RLS.  For  example,  the  LMS  algorithm,  operating  as  an  interference  canceler,  has  been 
suggested  as  a  separator  of  cross-polarized  signals  [1]. 

Begining  with  the  early  eighties  [2]  a  class  of  signal  separators  have  been  proposed  that 
in  effect  estimates  the  parameters  of  the  mixtnre  between  the  sources.  A  detailed  steady 
state  analysis  of  three  cross-coupled  “noise  cancelers”  structure  can  be  found  in  [3].  This 
research  was  aimed  mainly  for  communication  application  such  as  satellite  and  digital  mi¬ 
crowave  dually  polarized  links.  It  assumed  that  the  signals  to  be  separated  are  uncorrelated 
and  showed  separation,  provided  that  “some  knowledge”  is  available  by  which  the  signal  can 
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discriminated.  Later  the  same  separator  structures  were  suggested  for  application  to  neural 
networks  [4].  Their  algorithm  for  separation,  however,  assumed  the  signals  to  be  statistically 
independent  and  therefore  the  recovered  signals  should  be  statistically  independent.  While 
it  is  difficult  to  devise  criteria  for  testing  statistical  independence,  it  is  possible  to  design 
cost  functions  for  decorrelation  between  outputs.  Independence  cannot  be  achieved  directly, 
but  it  can  be  approached  by  minimizing  higher  order  cross-moments  by  way  of  gradient 
stochastic  algorithms  with  costs  defined  by  non-linear  functions  of  the  outputs  [5].  These 
signal  separators  are  derived  from  criteria  different  than  those  of  the  interference  cancelers, 
furthermore,  they  do  not  require  a  reference  signal,  hence  belong  to  the  class  of  blind  separa¬ 
tors.  Their  operation  suggests  a  bootstrap  process  where  cross-correlations  of  the  outputs, 
or  of  functions  of  the  outputs,  are  used  to  control  feedforward,  or  feedback,  filter  weights. 
Simulations  have  shown  that  LMS  decorrelating  algorithms  are  faster  than  LMS  algorithms 
minimizing  the  mean  square  error  [6]. 

In  this  paper  we  compare  various  signal  separation  algorithms  based  on  the  MSE  and 
decorrelation  criteria.  The  model  and  examples  reflect  a  multi-user  communication  system 
such  as  CDMA.  Each  user  transmits  an  assigned  unique  waveform  modulated  by  BPSK 
data.  The  waveforms  are  assumed  orthogonal  and  the  mixture  of  the  signals  arises  due  to 
the  channel  effects  such  as  crosstalk  and  dispersion.  The  main  contributions  of  the  paper 
are:  (1)  prove  that  the  eigenvalue  spread  associated  with  the  LMS  decorrelating  algorithm  is 
smaller  than  the  spread  associated  with  the  LMS,  thus  providing  an  explanation  for  results 
previously  reported  regarding  the  speed  of  convergence  of  the  LMS  decorrelating  algorithm, 
(2)  introduce  a  new  decorrelating  RLS  type  algorithm. 

The  system  model  is  presented  in  Section  .  In  Section  we  define  separation  criteria  and 
based  on  them  compute  expressions  for  the  steady  state  weight  vectors.  Adaptive  algorithms 
and  the  con 

rgence  analysis  of  the  LMS  decorrelator  are  developed  in  Section  .  In  Section  the  various 
algorithms  are  compared  by  computer  simulations.  Conclusions  are  provided  in  Section  . 

II.  PROBLEM  STATEMENT 


Consider  the  communication  of  N  sources  through  N  channels.  We  make  the  following 
basic  assumptions: 


1.  The  sources  are  independent. 

2.  The  outputs  of  the  channels  consist  of  a  linear  mixture  of  the  inputs. 

3.  Number  of  inputs  is  equal  to  the  number  of  sources. 


The  observation  signals  (t)  are  linear  combinations  of  the  input  sources,  such  that: 


Xr 


(0 


N 


n=l 


(t)  +  I^{t) 


(1) 


for  1  <  m  <  A.  G  {  —  1, 1}  is  the  n-th  information  symbol  bit  at  the  z-th  time  interval. 
The  information  symbols  are  assumed  to  be  statistically  independent  and  equiprobable. 
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hn  {t)  is  the  received  pulse  signature  associated  with  the  n-th  user.  The  energy  of  {t)  is 
nOTmalized  to  1,  i.e.  K  (t)d^  =  1,  and  in  is  the  energy  of  the  received  signal  from  the 
n-th  user.  Cross-correlations  between  the  sequences  are  included  in  the  matrix  A.  v  (t)  is 
(possibly  correlated)  additive  Gaussian  noise.  The  coupling  fixed  parameters  amn  are  not 
known.  The  received  signals  after  match  filtering  and  sampling  may  be  represented  by  the 
vectors: 

x  =  AEb-t-v  (2) 

where  A  is  the  mixture  matrix,  E  is  a  diagonal  matrix,  diag  E  =  ' ' '  ?  \/6v]  ,  and 

h  =  [bi,  •  •  • ,  is  the  vector  of  information  bits.  The  time  superscript  has  been  dropped 

since  the  decision  on  requires  observation  of  signals  during  that  interval  only,  thus  b 
represents  the  vector  of  information  bits  at  any  time.  The  vector  v  consists  of  independent 
sample 

mean  Gaussian  noise  with  covariance  matrix  E  vv^  =  where  G  is  the  known  matrix 
of  the  noise  cross-correlations  between  the  outputs  of  the  match  filters. 

III.  SIGNAL  SEPARATION  CRITERIA 

Two  signal  separation  criteria  are  considered:  the  Mean  Square  Error  (MSE)  separation 
and  the  signal  decorrelation  criterion.  The  criteria  are  used  to  develop  control  algorithms 
for  the  network  weights.  We  will  consider  separation  of  the  n-th  user  signal  from  the  rest  of 
the  signals,  other  channels  being  treated  similarly.  To  yield  the  n-th  user  ou 
put  the  input  vector  x  is  applied  to  the  adaptive  filter  represented  by  the  A  x  1  col¬ 
umn  vector  w„.  For  BPSK  transmission  a  sign  test  can  be  used  for  detecting  the  data  bit: 
bn  =  sgnyn- 

3.1  MSE  Separation 

The  MSE  signal  separator  minimizes  the  mean  squared  error  between  its  output  and  a 
reference  signal.  Typically  the  reference  is  initially  supplied  by  a  training  signal.  When  the 
adaptive  weights  converged  and  the  errors  with  respect  to  the  training  signal  are  small,  the 
detector  is  switched  to  operate  in  decision  directed  mode,  with  the  reference  signal  being 
supplied  by  the  estimated  symbol.  The  MSE  signal  separator  is  in  effect  the  optimum  linear 
detector  for  the  n-th  user.  It  acts  as  a  canceler  for  the  co-channel  interference  from  the  other 
users  and  minimizes  the  error  between  the  reference  signal  and  the  output  y-n-  lu  decision 
directed  mode  the  reference  signal  is  supplied  by  the  estimated  data  symbol,  hence  we  have 
e  (k)  =  bn  {k)-yn  {k)  ■  The  MSE  is  defined  e  =  E  [e^  {k)] .  In  the  following  we  drop  the  explicit 
time  dependency  since  signals  are  assumed  stationary.  The  steady  state  weight  vector  for 
separating  the  n-th  user  signal  is  given  by  the  well  known  Wiener-Hopf  equation. 


where  R^  =  E  xx^  is  the  input  correlation  matrix,  and  ^  =  E  6„x  is  the  cross¬ 


correlation  between  the  input  and  the  estimated  symbol.  We  will  show  that  in  the  absence 
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of  noise  the  MSE  criterion  indeed  leads  to  signal  separation.  When  the  input  vector  x  is 
given  by  the  model  in  equation  2,  the  input  correlation  matrix  can  be  written 


=  AEe[66^]eA^ 

AE^A^ 


The  cross-correlation  vector  is 


=  E  K{AEh  +  iy) 
=  AEu„ 


(5) 


where  u„  is  a  unit  vector  with  the  n-th  element  equal  to  1.  To  make  the  analysis  mathemat¬ 
ically  tractable  it  is  assumed  that  in  the  decision  directed  mode  correct  decisions  are  made 
with  high  probability  such  that  E  and  in  general  E  bb  =1,  where  I  is  the 

identity  matrix.  From  ,  (4)  and  (5)  the  weight  vector  w„  is  equal, 

w„=  A"^E“^u„  (6) 


The  information  bit  is  recovered  when  this  weight  vector  is  applied  to  the  noiseless  data: 
yn  =  wjx  =  u^E-’-A^^AEb  = 

3.2  Decorrelation  Signal  Separation 

The  decorrelation  separator  is  different  from  the  MSE  separator.  The  MSE  weight  vector 
for  any  user  is  controlled  by  the  error  derived  solely  from  the  output  corresponding  to  that 
user.  In  contrast,  the  decorrelator  weight  vector  is  controlled  by  all  the  channels’  outputs. 
Suggesting  an  operation  where  each  channel  is  helped  by  the  other  channels  this  structure 
has  been  referred  to  as  bootstrapped  [6].  The  decorrelator  seeks  to  decorrelate  the  output 
yn  from  a  set  of  reference  signals  which  represents  the  other  users’  symbols.  Operating  in  a 
mode  similar  to  the  decision  directed  mode,  decorrelation  is  achieved  when 

E  2/„b]  =  c„u„  (7) 

where  c„  is  a  constant.  When  condition  (7)  is  met,  is  decorrelated  from  all  components 
of  b  other  than  its  own  First  we  will  show  that  in  the  noiseless  case  this  criterion  indeed 
leads  to  signal  separation,  then  we  will  define  and  optimize  a  cost  function  to  compute  the 
steady  state  decorrelating  weight  vector. 

For  the  noiseless  case  we  have  E  2/nb  =  E  (wjAEb)  b  .  After  a  few  algebraic  manip¬ 
ulations,  we  get  that  (7)  is  equivalent  to  w^AEu„  =  c„u„,  for  1  <  n  <  A.  If  we  define  the 
matrices  W  =  [wi,  ■  •  • ,  w^-]  and  C  =  diag  [c] ,  •  •  • ,  c^] ,  then  the  weight  matrix  W  can  be 
written: 

W  =  c(a^E)“'  (8) 

Indeed,  when  this  weight  matrix  is  applied  to  the  input  x,  the  output  is  proportional  to  the 
data  vector:  y  =  W^x  =  W^AEb  =  Cb. 
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In  the  general  case  when  noise  is  present,  the  decorrelation  condition  in  (7)  can  be 
achieved  by  defining  the  cost  function  J  =  where  =  E  [bx^]  ,  and  minimizing 

it  with  respect  to  and  subject  to  the  condition  wju„  =  1.  The  condition  sets  Wnn  =  T 
such  that  the  n-th  component  of  the  input  is  transferred  directly  to  the  output.  Indeed 
the  minimum  of  the  cost  function  J  is  achieved  when  the  gradient  of  J  is  set  to  zero  and 

W.„.n  =1:  r 


E  Vnh 


which  is  equivalent  to  the  decorrelation  condition  (7).  The  steady  state  decorrelating  weight 
vector  is  given  by; 


where  the  scaling  factor  /?  =  is  set  to  meet  the  linear  constraint. 

3.3  Adaptive  Algorithms 


When  the  signal  environment  is  not  stationary  weight  vectors  for  the  MSE  and  the  decor¬ 
relator  can  be  calculated  using  steepest  descent  algorithms. 

3.3.1  MSE  Separator  Algorithms 

The  LMS  algorithm  for  updating  the  weight  vector  of  the  MSE  separator  is  given  by 

w„  (A:)  =  Wn{k  —  1)  +  fJ.e{k)yi{k) 

e{k)  =  6„(A:) -x^(fc)w„(/j  -  1) 

To  distinguish  it  from  the  LMS  decorrelator  we  refer  to  (11)  as  the  LMS  error  algorithm. 
The  convergence  analysis  of  the  LMS  algorithm  is  well  known.  The  necessary  and  sufficient 
condition  for  the  convergence  of  the  LMS  algorithm  is  that  the  convergence  factor  0  <  //  < 
2/A^ax(R^),  where  A^ax  (R^.)  is  the  largest  eigenvalue  of  the  covariance  matrix  R^.  It  is 
also  well  known  that  the  speed  of  the  LMS  algorithm  is  determined  by  eigenvalue  spread 

X  ~  Amax  (Ra;)  /  Amin  (Rx)  ■  •  •  i  a- 

The  RLS  algorithm  for  the  MSE  separator  is  based  on  a  recursive  implementation  ol 

relation  (3)  using  the  following  updates  of  the  covariance  matrix  R^,  and  cross-correlation 
vector  r-j  ^  : 

R^(A:)  =  (1 -a)R^(fc-l)  +  ax(fc)x^(fc)  ^2) 

=  (1  -  -  1)  +  ax(fc)6„(fc) 
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where  a  is  the  forgetting  factor.  From  (12)  and  (3)  we  get  the  following  recursion  relations: 

n  (h\  —  _ R-j:  _ 

^  ^  ’  i^+xT(A:)Rjhi:-l)x(A:) 

1)  (13) 

e{k)  =  q;6„(A:) -x^(A:)w„(^  -  1) 

w„(fc)  =  w„(A;-l)  +  g(A:)e(^) 

3.3.2  LMS  Decorrelator 

The  LMS  decorrelator  is  a  steepest  descent  algorithm  that  seeks  to  null  the  instantaneous 
estimate  of  the  gradient  of  the  cost  function  J.  From  (10)  we  have,  VJ  =  and  for 

the  instantaneous  estimate  VJ  {k)  =  b  {k)  (A;)  w„  {k)  =  [k)  w„  {k) .  According  to  our 

formulation  Wnn  =  1,  hence  to  preclude  adaptation  of  we  premultiply  the  gradient  by 
U„  =  which  zeros  its  n-th  component.  The  LMS  decorrelator  algorithm  is  given 

by: 

w„(A;)  =  w„(A:- 1) -y«y„(A;)U„b(A;) 

'^nn  —  1 

This  relation  can  also  be  interpreted  directly  as  seeking  to  minimize  the  instantaneous  corre¬ 
lation  between  the  output  of  the  separating  filter  for  the  n-th  source  and  the  other  detected 
sources. 

3.3.3  Convergence  Analysis  of  LMS  Decorrelator 

The  LMS  decorrelator  have  been  observed  to  converge  faster  than  the  conventional  LMS 
error  algorithm  [6].  In  this  part  we  will  prove  that  the  eigenvalue  spread  for  the  LMS 
decorrelator  is  smaller  than  for  the  LMS  error,  hence  the  former  has  faster  convergence 
speed 

First  we  determine  necessary  and  sufficient  conditions  for  convergence  of  the  LMS  decor¬ 
relator.  Define  B  =  —  u„u^j  b,  we  can  rewrite  (14) 


w„(A;)  =  w„  (A:  -  1)  - //?/„  (A:.)B  (A;) 

=  1  -  fiB  (k)  :>J  (k)  w„(A'-l) 


To  analyze  the  convergence  speed  we  will  estimate  the  eigenvalue  spread  for  the  LMS 
decorrelator  and  compare  it  with  the  spread  for  the  LMS  error.  Using  the  assumptions  stated 
previously  for  the  LMS  error  analysis,  we  can  write: 

E[w„(A;)]  =  1  E[w„(A:  -  1)1  ^  ^ 


and  used  the  relations 


F  ~  E  [wn  (A:  -  1)1 
I  -  E  [w„  (A:  -  1)1 


R^^  =  U„R^^  =  U„  E  [b  (b^EA^  +  v)]  =  U„  E  [bb^]  EA^'  =  U„EA^.  (17) 
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Without  loss  of  generality,  consider  the  separation  of  source  n  =  1,  from  the  other 
sources.  Define  the  following  matrix  and  vector  partitions:  ^  ~ 


1  :  0  \ 

...  ;  ...  ,B 


(  0  •  B'^  )  ’  ~  ^  write  equation 


V  0  ;  r  / 

(15)  as  follows: 


E  [W  {k)]  =  [l'  -  J  E  [W  (fc  -  1)]  -  /X  E  [b'  (^)  (k)  Wi  {k  -  1)]  (18) 

We  deduce  that  the  convergence  speed  of  the  LMS  decorrelator  is  controlled  by  the  spread 
of  the  eigenvalues  of 

Proposition  1:  The  following  relation  exists  between  the  largest  eigenvalues  ofKg,^  and 

^max  (R^a:)  —  i^g'x) 

Proof:  The  matrix  R^'^,  can  be  expressed  as  a  partition  of  the  matrix  R^^. 


Rr  =EA^  = 

OX 


V^^  ■■  rf 


Rp' 

±>  X 


Define  the  ratio 


p{z)  = 


According  to  the  Courant-Fisher  theorem  [7],  Amax  (R^j;)  —  P{^)-  Using  the  matrix 

partition  in  (20)  and  maximizing  p  (z)  over  the  restriction  Zi  =  0,  we  get 

Amax  (Rj^)  >  P  (^)  =  Amax  (Rfi'^)  (22) 


Proposition  2:  The  following  relation  exists  between  the  smallest  eigenvalues  ofRg,^  and 
^bx'- 

Amin  (Rfca;)  —  (Bg'j.)  (2^) 

Proof:  According  to  a  corollary  to  the  Courant-Fisher  theorem,  and  noting  that  R^,^  is 
the  (A  -  1)  X  (IV  -  1)  leading  principal  submatrix  of  R.j^,  we  have: 

Amin  (R^x)  —  (B-g'j.)  =  ^min  (R§'x)  (2^) 

Proposition  3:  If  the  noise  is  negligible  then  the  eigenvalues  o/R^,  are  equal  to  the  square 
of  the  eigenvalues  ofH-^^. 
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Proof:  We  have  the  relation  =  EA^,  and  for  negligible  noise  we  have  R2;=  AEEA^.  Consequently 
R^  =  RJ  R-^^.  It  follows  that  (R^)  =  A?  ^R-^^for  1  <  z  <  A.  In  particular, 


-^max  (Rx) 
Amin  (Ra;) 


(25) 


Proposition  f:  The  eigenvalue  spread  x(R'a:)  associated  with  the  LMS  error  algorithm,  is 
larger  than  the  spread  x  associated  with  the  LMS  decorrelator. 

Proof:  From  Propositions  1,2  and  3  we  have; 

X(R.)  =  X^(%J>X(%0  (26) 

Proposition  4  provides  the  explanation  why  the  LMS  decorrelator  algorithm  is  faster  than 
the  LMS  error  algorithm. 


3.4  RLS  Decorrelator 


We  introduce  a  new  decorrelating  RLS  type  algorithm.  Paralleling  the  development  of 
the  RLS  algorithm  from  the  MSE  weight  vector  in  (3),  we  developed  the  following  RLS 
algorithm  for  the  decorrelator  weight  vector  in  (10): 

_  Rp(k-i)Uk) 

^+x'^(k)Rp{k-i)h{k) 

=  Wn{k  -  1)  -  g{k)x^  {k)wn(k  -  1) 

_  Vn(fe) 

uWn(A) 

where  the  last  relation  is  designed  to  meet  the  linear  constraint  =  1.  The  four  algo¬ 

rithms  are  evaluated  in  the  next  section. 


g(^) 

(k) 

v„  (A;) 

w„  {k) 


IV.  PERFORMANCE  EVALUATION 


The  performance  of  separation  algorithms  was  evaluated  in  terms  of  the  probability  of 
error  for  detecting  the  transmitted  data.  Consider  the  probability  of  bit  error  for  the  n-th 
user.  Since  the  data  bit  can  take  on  one  of  two  equiprobable  values  the  total  probability  of 
error  is  equal  to  the  conditional  probability  P  [&„  >  0  |  =  -l]  .  The  probability  of  error 

at  the  output  of  a  separator  with  weight  vector  w„  is  given  by; 


R  = 


p 

bn>0 

1  — 

-1 

^1 

O 

A 

bji  — 

-1] 

p 

A 

s 

=  - 

-1 

{/>  [wjx 

> 

bn  —  “1 

2^- 

-N 

^be{- 

q( 

inn 

(j 

bn 

=-l 

\ 
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where  Q{z)  =  V dy,  tnj  is  the  j-th  element  of  the  vector  t„  =  w^AE,  and  the 

zero-mean  Gaussian  noise  was  assumed  white. 

The  convergence  of  the  algorithms  was  studied  by  simulations.  The  simulations  consisted 
of  a  A  =  4  users  system.  Without  loss  of  generality,  the  error  probability  was  computed  for 
the  first  user.  In  generating  the  signal  model  we  assumed  a  fixed  cross-correlation  between 
any  two  users  of  aij  =  0.15,  for  any  i  /  j.  In  the  following  figures  the  SNR  is  defined  with 
respect  to  the  first  user,  and  the  signal-to-signal  ratio  (SSR)  is  defined  as  the  ratio  of  the 
bit  energy  of  the  first  user  to  the  bit  energy  of  any  of  the  other  users.  Each  curve  is  the 
result  of  averaging  ten  independent  runs.  Figure  1  shows  the  average  learning  curves  of  the 
probability  of  error  for  SNR  =  8  dB  and  SSR  =  -  10  dB.  The  LMS  error  does  not  converge, 
and  the  RLS  error  and  RLS  decorrelator  converge  faster  than  the  LMS  decorrelator.  In 
Figure  2,  the  interference  power  is  reduced  to  SSR  =  -5  dB,  the  LMS  error  has  the  slowest 
convergence  while  the  other  three  algorithms  converge  at  comparable  speeds.  Comparison 
of  Figures  1  and  2  substantiates  our  claims  that  the  LMS  decorrelator  is  less  sensitive  to  the 
eigenvalue  spread  than  the  LMS  error. 

To  estimate  the  convergence  region  of  each  algorithm  we  took  the  approach  suggested  in 
[8].  A  figure  of  merit  7  is  defined  that  relates  the  initial  and  final  probabilities  of  error,  P^i, 
and  P.f,  as  follows: 

7  =  1-^  (27) 

when  Pef  <.  Pei,  7  1.  Note  that  7  =  0  corresponds  to  no  convergence  is  indicated  by 

Pgj  =  Pg,-.  Figure  3  shows  the  convergence  curves  for  SSR  =  .-10  dB.  The  SNR  is  varied  from 
-10  dB  to  +10  dB  to  result  in  the  initial  probabilities  of  error  indicated  on  the  abscissa.  The 
LMS  error  does  not  converge  for  these  parameters  while  the  LMS  decorrelator  and  the  RLS 
error  show  slightly  wider  regions  of  convergence  than  the  RLS  decorrelator.  In  Figure  4  the 
convergence  is  shown  for  SSR  =  -5  dB.  The  RLS  decorrelator  shows  the  poorest  convergence. 
The  LMS  decorrelator  and  the  RLS  error  have  wider  regions  of  convergence  than  the  LMS 
error. 

V.  CONCLUSIONS 

In  this  paper  we  showed  that  the  LMS  decorrelator  algorithm  is  faster  than  the  LMS  error 
algorithm  due  to  a  lesser  eigenvalue  spread.  We  introduced  a  new  RLS  type  algorithm  for 
decorrelation.  The  RLS  decorrelator  was  shown  to  be  faster  than  the  LMS  type  algorithms 
and  of  comparable  speed  with  the  conventional  RLS  error.  Regions  of  convergence  for  the 
LMS  decorrelator  and  the  RLS  algorithms  were  shown  to  be  wider  than  for  the  LMS  error 
algorithm. 
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Figure  1:  Learning  curve  of  the  probability  of  error  of  the  first  user  (SNR=8dB,SSR: 
10dB,N=4,p=0.15) 
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Figure  2:  Learning  curve  of  the  probability  of  error  of  the  first  user  (SNR=8dB,SSR 
5dB,N=4,p=0.15) 


Figure  3:  Convergence  regions  after  250  iterations  for  the  probability  of  error  of  the  first 
user  (SNR=8dB,SSR=-10dB,N=4,/9=0.15) 


Figure  4:  Convergence  regions  after  250  iterations  for  the  probability  of  error  of  the  first 
user  (SNR=8dB,SSR=-5dB,N=4,p=0.15) 


APPENDIX  A:  PART  II 


ADAPTIVE  DETECTORS  FOR  MULTI-CHANNEL  SIGNALS  IN  CODE 
DIVISION  MULTIPLE  ACCESS  SYSTEMS 

by 

Alex  Haimovich,  Ruth  Onn  and  Yeheskel  Bar-Ness 

ABSTRACT 


This  paper  considers  adaptive  receivers  for  multi-channel  cross-correlated  signals.  The 
specific  application  considered  is  the  separation  of  signals  in  code  division  multiple  access 
(CDMA)  systems,  but  the  methods  developed  are  applicable  to  other  similarly  modeled 
problems.  The  minimum  mean  square  error  (MSE)  and  the  signal  decorrelation  criteria 
are  applied  to  derive  feedforward  and  feedback  detector  structures.  It  is  shown  that  in  the 
noise-free  case,  and  at  steady  state,  these  detectors  indeed  perform  signal  separation.  The 
performance  of  these  adaptive  detectors  in  noisy  environment  is  compared  against  each  other 
and  to  the  ideal  decorrelator  detector.  The  Feedback  Minimum  Error  detector  is  found  to 
have  the  closest  performance  to  the  ideal  decorrelator. 

I.  INTRODUCTION 

This  paper  considers  adaptive  receivers  for  code-division  multiple-access  (CDMA)  signals. 
A  number  of  low  complexity  structures  have  been  proposed  by  Verdu  [1],  Aazhang  [2],  and 
others  to  solve  the  multi-user  detection  problem  with  little  degradation  vis-a-vis  the  optimal 
detector.  In  particular  the  decorrelating  detector  was  shown  to  be  near-far  resistant,  i.e., 
insensitive  to  the  power  of  the  interfering  signals. 

Most  of  the  strategies  for  multi-user  detection  exploit  the  knowledge  of  the  codes  assigned 
to  the  users  to  decorrelate  between  the  signals.  Furthermore,  it  is  customarily  assumed  that, 
following  match  filtering  in  each  channel,  the  cross-correlations  between  the  signals  are  known 
and  can  be  eliminated  by  the  fixed  decorrelating  receiver.  Nevertheless,  in  practice,  channel 
effects  often  cause  unknown  correlations  at  the  output  of  the  filters.  This  paper  addresses 
the  design  of  adaptive  receivers  for  the  separation  of  CDMA  signals.  The  work  has  been 
inspired  by  structures  previously  suggested  for  blind  signal  separation  and  referred  to  as 
bootstrap  algorithms  [3]. 

In  Section  II  we  define  the  system  model,  review  the  single  user  and  the  fixed  decorrelator 
detectors,  and  present  the  feedforward  and  feedback  configurations  for  adaptive  signal  sep¬ 
aration.  Four  adaptive  detection  criteria,  two  for  the  feedforward  and  two  for  the  feedback 
configurations,  are  presented  in  Section  III.  LMS  type  algorithms  are  suggested  for  each 
detector.  The  performance  of  the  adaptive  detectors  is  compared  by  simulations  in  Section 
IV.  Section  V  provides  the  conclusions. 
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IL  SYSTEM  MODEL 


Consider  an  Y-user  synchronous  CDMA  system.  We  make  the  following  basic  assump¬ 
tions: 

(1)  The  sources  are  independent. 

(2)  The  outputs  of  the  channels  consist  of  a  linear  mixture  of  the  inputs. 

(3)  Number  of  inputs  is  equal  to  the  number  of  sources. 

The  waveform  (code)  used  by  the  n-th  user  is  denoted  by  0  <  f  <  Tg,  where  Tg  is 

the  data  bit  duration.  This  waveform  is  used  to  modulate  each  of  the  n-th  user  information 
bits.  The  received  waveform  is  given  by  c„(f)  =  gn{t)  *  hn,{t),  where  hn{t)  is  the  n-th  channel 
(unknown)  impulse  response.  The  channel  causes  interchip  interference,  but  it  is  assumed 
that  the  intersymbol  interference  is  negligible.  The  received  signal  is  applied  to  a  bank  of 
filters,  which  are  matched  to  the  signature  waveforms  Sf„(t).  Due  to  the  channel  effects,  the 
outputs  of  the  match  filters  is  represented  by  an  unknown  correlation  matrix  A  =  {Aij}.  A 
is  normalized  such  that  =  1.  The  channel  response  is  assumed  not  to  vary  with  time; 
hence,  A  consists  of  fixed  elements.  It  is  also  assumed  that  A  is  strictly  diagonal  dominant, 
i.e.,  \Aij\  >  ^  \Aij  \ .  This  assumption  implies  that  the  autocorrelation  of  the  signal  in  any 

channel  is  larger  than  the  sum  of  the  cross-correlations  between  that  channel  and  the  other 
channels.  With  this  assumption,  A  is  invertible.  Let  x  be  a  vector  representing  the  bank  of 
match  filters  outputs  sampled  at  f  =  iTg.  Then  x  can  be  written: 

X  =  AEb  -f  z  (1) 

where  E  is  the  diagonal  matrix,  E  =  diag  . . . ,  ,  and  =  (6i, . . . ,  fejv)  ■  The 

information  symbols  G  {  —  1,1}  are  outcomes  of  identical,  uniformly  distributed,  and  in¬ 
dependent  between  users  random  variables.  is  the  unknown  energy  of  the  received  signal 
from  user  n.  A  time  index  is  not  used  since  decisions  are  based  on  observations  in  a  single  data 
bit  interval.  The  noise  vector  z,  is  assumed  Gaussian  with  correlation  matrix  R(z)  =  cr^G, 
where  G  is  the  cross-correlation  matrix  at  the  output  of  the  match  filters  when  the  input  is 
white  noise  ((G), g,{t)gj{t)dt). 

2.1  Single  User  Detector 

The  conventional  single  user  detector  makes  its  decision  based  on  the  sign  of  the  appro¬ 
priate  element  of  x.  Thus  the  decision  for  the  n-th  user  is  given  by: 

K  =  sgn(x„)  (2) 
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The  n-th  user  error  probability  of  the  single  user  detector  can  be  calculated  as  follows: 


Pn  =  P[xn>  ^\K  =  -1] 

=  P[x„>0|b]P[bl6„  =  -l] 


6„  =  -l 


=  2'""  5:  Q 


be{-i,ir 


^(GX 


where  P  (b  |  6„  =  —1)  =  2^  ^ . 

2.2  Decorrelation  Detector  -  Known  Correlation  Matrix  A 

If  channel  effects  are  ignored  then  A  =  G  is  known.  The  data  vector  b  can  be  re¬ 
covered  from  the  outputs  of  the  bank  of  match  filters  as  follows:  y  =  G  =  G  ^ 
(GEb  +  z)  =  Eb  +  ri.  The  gaussian  noise  vector  t}  is  zero  mean  and  has  covariance  ma¬ 
trix  K{r])  =  G“^R(z)G"^  =  (G~^  =  (G“^)^).  For  the  n-th  user  we  have 

Vn  =  +  Vn,  and  bn  =  Sgn(y„).  Hence,  the  n-th  user  probability  of  error  is  given  by 


In  this  work  we  are  concerned  with  the  design  of  multi-user  detectors  when  the  correlation 
matrix  A  is  not  known.  In  this  context  we  distinguish  between  two  basic  configurations: 
feedforward,  in  which  the  adaptive  filter  operates  on  the  inputs  vector  x,  and  feedback,  in 
which  the  adaptive  filter  operates  on  the  outputs  vector  y.  Both  configurations  are  shown 
schematically  in  Figure  1 

2.3  Adaptive  Detector  -  Feedforward  Configuration 

The  n-th  user  output- is  obtained  by  applying  the  n-th  channel  filter  coefficient  vector 
w„  to  the  input  vector  x.  If  the  weight  matrix  for  the  forward  configuration  is  defined 
W  =  [wi,  W2,  •  •  •  ,  Wjv] ,  the  output  vector  is  given  by  y  =  WAEb  +  Wz.  The  decision 
sheer  output  is  given  by  K  =  sgn  (?/„)  =  sgn(w^x).  For  any  specific  weight  vector  w„,  the 
probability  of  error  for  the  n-th  user  can  be  calculated  as  follows: 


=  P[6n>0|6„  =  -lJ 
=  P  [yn  >  0  I  =  -1] 

=  P  >  0  I  =  -1] 

^nn  ^ 

=  2'-"  E  Q  - 

0-V7W 


6n='-l 
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where  tnn  is  the  n-th  element  of  the  vector  t„  =  wjAE,  and  =  w^Gw„  is  zero-mean 
Gaussian  noise  with  variance 

2.4  Adaptive  Detector  -  Feedback  Configuration 

With  the  feedback  adaptive  multi-user  detector  the  output  ?/„  results  from  subtracting  a 
linear  combination  of  the  other  outputs  vjy„,  from  the  n-th  user  input  ?/„  =  -  vjy„, 

where  v„  denotes  the  A  x  1  feedback  weight  vector.  y„  is  the  vector  of  outputs  obtained 
from  y  by  setting  ?/„  =  0.  The  last  relation  can  be  rewritten  =  2/„  -J-  v^y„  =  vjfy,  where 
Vnn  =  1-  We  define  the  weight  matrix  for  the  feedback  configuration  V  =  [vi,  V2,  •  •  • ,  Va^]^, 
which  relates  the  output  of  the  adaptive  receiver  to  its  input,  x  =  Vy.  If  V  is  invertible  then 
the  output  y  can  be  obtained  as  y  =  V  ^x.  Otherwise,  the  generalized  inverse  can  be  used  to 
compute  y.  The  probability  of  error  can  then  be  found  from  eq.  (5)  using  in  place  of  W. 

III.  Signal  Decorrelation  Criteria 

In  this  section  we  define  four  signal  separation  criteria.  Two  are  based  on  the  minimiza¬ 
tion  of  the  mean  square  error  (MSE)  between  the  channel  output  and  a  reference  supplied 
by  the  detected  output.  The  other  two  criteria  are  for  decorrelation  of  the  outputs. 

3.1  Feedforwrard  Minimum  Error  Detector 


The  Feedforward  Minimum  Error  (FME)  detector  minimizes  the  mean  error  between  its 
output  and  a  reference  signal.  Typically,  the  reference  is  initially  supplied  by  a  training 
signal.  When  the  adaptive  weights  converged  and  the  errors  with  respect  to  the  training 
signal  are  small,  the  detector  is  switched  to  operate  in  decision  directed  mode,  with  the 
reference  signal  supplied  by  the  detected  symbol.  The  FME  detector  realizes  the  optimal 
linear  receiver  for  each  of  the  users.  The  filter  coefficients  may  be  updated  using  the  LMS 
algorithm: 

'^nik  +  1)  —  Wn(k)  ~  fJ-e{k)x{k)  (6) 

where  e  (A:)  =  (A;)  —  (A;)  and  yn{k)  =  wf{k)x[k  —  1).  The  steady  state  solution  of  the 

FME  detector  in  terms  of  the  weight  matrix  is  given  by, 


Wfme  =  arg  imn  E 


R: 


(7) 


is  the  input  correlation  matrix,  and  =  E[xb^l  is  the  cross- 


where  R^  =  E  xx^ 

correlation  matrix  between  the  input  and  the  estimated  symbol.  This  criterion  leads  to 
signal  separation  in  the  noise-free  case  and  to  power-inversion  interference  cancellation  when 
additive  noise  is  present.  In  the  noise-free  case  the  input  correlation  matrix  can  be  written: 


AEE 

AE^A 


bb^ 


EA^ 


(8) 
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where  E^,  is  a  diagonal  matrix  of  the  signal  energies,  and  we  used  E  [bb^]  =  I,  since  the 
signals  are  assumed  independent  between  users. 

The  cross- correlation  matrix  is: 

=  E[AEbb^-]  (g) 

=  pAE 

where  we  used,  p  =  1  -  2Pr  ^  b,]  =  1  -  2Pe,  and  the  result  [4]: 


E  bibj 

. 


1  -  2Pe  i  =  j 

0  j 


(10) 


Then  the  weights  are  given  by  W pjvig  =  pA  ^E  h  In  the  absence  of  noise,  this^linear  trans¬ 
formation  applied  to  the  input  vector,  recovers  the  transmitted  signals,  y  =  =  pb. 


3.2  Feedforward  Zero  Correlation  Detector 


The  Feedforward  Zero  Correlation  Detector  (FZC)  is  different  from  the  FME  detector. 
The  FME  weight  vector  for  any  user  is  controlled  by  the  error  derived  solely  from  the  sheer 
output  corresponding  to  that  user.  In  contrast,  the  n-th  channel  FZC  weight  vector  is 
controlled  by  all  other  sheers’  outputs.  Separation  is  achieved  through  decorrelation  of  the 
n-th  channel  sheer  input  from  all  the  other  sheers’  outputs.  LMS  zero  correlation  algorithms 
have  been  proven  to  have  faster  learning  curves  than  LMS  minimum  error  algorithms  [4]. 
The  LMS  zero  correlation  algorithm  for  the  n-th  channel  is  given  by: 

w„(A:  4- 1)  =  w„(fc)  -  (A:)  b„  (fc)  (11) 


where  b^  =  6i,  ■  ■  • ,  0,  ■  ■ '  is  obtained  from  the  vector  of  estimated  outputs  b,  by  setting 

bn  =  0.  The  input  to  the  n-th  channel  is  passed  with  unity  gain,  Wnn  =  1-  The  algorithm 
converges  in  the  mean  when  E  [j/„b„  =  0,  or  equivalently,  when  E  y„bj  =  OnUn,  where  u„ 
is  a  unit  vector  with  the  n-th  element  set  to  1,  and  is  chosen  such  that  Wnn  —  1-  Using 
matrix  notation,  decorrelation  is  achieved  for  E  [yb^]  =  diag (oi, . . . ,  ajv)  =  D.  In  terms  of 
the  weight  matrix  W,  and  using  eq.  (9),  this  condition  can  be  written: 

E  [yb^l  =  E  [Wp^cl-^Eb -f  z)b'^ 


Wp2C  =  E)(pAE)  ^  (13) 

Indeed,  when  this  weight  matrix  is  applied  to  the  input  x,  each  channel  output  consists  of 
a  scaled  output  of  that  channel’s  user  and  noise:  y  =  (^Eb  +  z)  = 

p'^Db  4-  Wp2C^-  Notice  that  with  the  decorrelation  criterion  it  is  not  necessary  to  assume 
a  noise-free  environment  in  order  to  achieve  separation. 
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3.3  Feedback  Minimum  Error  Detector 

The  Feedback  Minimum  Error  (BME)  detector  minimizes  the  MSE  between  the  input  to 
the  n-th  channel  sheer  and  a  reference  signal  supplied  by  the  sheer  output  It  achieves 
that  by  subtracting  from  a  linear  combination  of  the  other  users’  outputs,  yn-  The 
error  is  given  by  e  (k)  =  (k)  -  |/„  (k) ,  and  the  MSE  is  defined  by  e  =  E  [e^  (k)] .  Using 

y  =  V“^x,  the  BME  weight  matrix  is  given  by: 

2* 

^BME  =  arg  min  E  b-V-^x 
= 

xb 

Using  eqs.  (8)  and  (9)  we  get  for  the  noise-free  case,  VbME  =  (p-'E-‘A-')  (AE^Aq  = 
The  feedback  filter  coefficients  Vg]yjg  achieve  signal  separation:  y  =  = 

ph.  The  n-th  channel  BME  feedback  filter  coefficients  are  updated  using  the  following  LMS 
algorithm: 

Vn{k  +  1)  =  Vnik)  +  fie{k)yn{k)  (14) 

With  the  n-th  element  of  y„  set  to  zero,  the  algorithm  does  not  control  which  is  set  to 
1.  Convergence  in  the  mean  is  achieved  when  the  error  between  the  input  to  the  sheer  and 
its  output  is  decorrelated  from  the  inputs  to  the  other  sheers. 

3.4  Feedback  Zero  Correlation  Detector 


The  Feedback  Zero  Correlation  (BZC)  detector  is  the  feedback  configuration  analogous 
to  the  FZC  detector.  The  updating  of  the  n-th  channel  feedback  weight  vector  is  given  by 
the  following  LMS  decorrelation  algorithm: 


^nik  +  1)  =  Vn{k)  ~  Iiyr,{k)hn{k)  (15) 

where  the  vector  b„  has  been  dehned  previously.  The  algorithm  does  not  control  Vnn,  which 
is  set  to  1.  This  algorithm  converges  in  the  mean  when  E[y„y„]  =  0,  or  equivalently,  when 
E  yb^]  =  diag  (oi, . . . ,  oat)  =  D,  where  the  elements  of  D  are  such  that  Vnn{k)  =  1.  The 
feedback  filter  coefficients  Vg2(q  can  be  found  as  follows: 


E  [yb^ 


=  E  (AEb -f  z)  b^ 

=  pVpypAE 


Hence, 


^BZe  =P^  ^EA^ 


Indeed,  when  this  weight  matrix  is  applied  to  the  input  x,  each  channel  output  consists 
of  the  scaled  corresponding  user  signal  and  noise:  y  =  Vg^^x  =  ^g2C  (^Eb  +  z)  = 

pD  ^b  -f  Similar  to  the  EZC  detector  case,  the  noise-free  assumption  is  not  re¬ 

quired  by  the  BZC  detector  for  signal  separation. 
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IV.  SIMULATIONS  RESULTS 


A  performaiicc  comparisoii  of  ths  operation  of  those  detectors  in  noise  has  been  carried 
out  by  simulations.  The  simulation  model  consisted  of  a  iV  =  4  channel  system,  with  fixed 
cross-correlations  of  0.2.  Probabilities  of  error  were  calculated  after  convergence  of  the  four 
algorithms  in  eqs.  (6),  (11),  (14),  and  (15),  respectively.  Figure  2  shows  the  probability  of 
error  of  the  first  signal  as  a  function  of  the  power  ratio  of  the  second  to  the  first  signal,  with 
the  powers  for  signals  3  and  4  kept  constant,  and  SNR  =  8  dB.  Curves  for  probability  of 
error  for  decorrelation  with  known  cross-correlation  matrix  A  and  single  user  detector  are 
also  shown.  It  can  be  observed  that  the  performance  of  the  adaptive  BME  detector  is  the 
best  among  the  methods  considered.  It  is  actually  slightly  better  even  than  the  decorrelator 
detector  with  known  A,  for  the  whole  range  of  power  of  the  second  user  signal.  The  advan¬ 
tage  of  the  BME  method  is  the  consequence  of  minimizing  an  error,  rather  than  imposing 
the  zero  correlation  condition.  It  is  also  noted  that  the  EME  detector  is  sensitive  to  the 
power  level  of  the  interference,  and  has  the  poorest  performance  among  the  adaptive  detec¬ 
tors  when  the  interference  is  lower  than  the  desired  signal.  Figure  3  shows  the  probability  of 
error  of  the  first  user  as  a  function  of  the  SNR,  when  the  power  ratio  of  the  second  user  to 
the  first  user  is  5  dB.  The  BME  has  the  closest  performance  to  the  known  correlation  matrix 
detector.  The  FZC  and  BZC  detectors  are  slightly  worse. 

V.  CONCLUSIONS 

Four  adaptive  methods  for  signal  separation  are  compared.  Two  methods  are  based  on  a 
feedforward  configuration,  the  feedforward  minimum  error  and  the  feedforward  zero  correla¬ 
tion,  and  two  methods  are  based  on  a  feedback  configuration,  the  feedback  minimum  error 
and  the  feedback  zero  correlation.  The  feedback  minimum  error  is  shown  to  have  the  best 
performance  in  terms  of  probability  of  error. 
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FEEDFORWARD  FEEDBACK 

Figure  1:  Feedforward  and  feedback  configurations  of  the  adaptive  multiuser  detector. 
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Ratio  of  Second  to  First  Signal  (dB) 

Figure  2:  Probability  of  error  in  first  signal  versus  the  ratio  of  the  second  to  first  signal.  The 
methods  compared  are:  FZC  (dashed  line),  BZC  (dash-dot),  FME  (regular  LMS)  (  circles 
on  dots),  and  BME  (continuous  line).  Also  on  the  plot  are  the  lines  indicating  the  single 
user  detector,  and  the  de-c  orrelator  detector  probabilities  of  error  (dotted  lines). 


56 


Figure  3:  Probability  of  error  versus  SNR  for  first  signal.  The  methods  compared  are:  FZC 
(dashed  line),  BZC  (dash-dot),  FME  (regular  LMS)  (  circles  on  dots),  and  BME  (continuous 
line).  Also  on  the  plot  are  the  lines  indicating  the  single  user  detector,  and  the  de-c  orrelator 
detector  probabilities  of  error  (dotted  lines). 
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APPENDIX  B:  PART  I 


ERROR  PERFORMANCE  OF  SYNCHRONOUS  MULTIUSER 
CODE  DIVISION  MULTIPLE  ACCESS  DETECTOR 
WITH  MULTIDIMENSIONAL  ADAPTIVE  CANCELER 

by 

Zoran  Siveski,  Yeheskel  Bar-Ness  and  David  W.  Chen 


ABSTRACT 


A  code  division  multiple  access  detector  that  employs  a  combination  of  a  decorrelator  and 
a  multidimensional  interference  canceler  is  considered.  The  weights  of  the  canceler  are  adap¬ 
tively  controlled  using  a  steepest  descent  algorithm.  The  probability  of  error  is  evaluated 
and  compared  to  those  of  the  decorrelating  detector  and  a  similar,  fixed-weights  scheme  that 
requires  an  estimation  of  received  signal  energies.  It  is  shown  that  the  proposed  two-stage 
detector  provides  substantial  improvement  over  the  decorrelating  detector,  particularly  in 
the  presence  of  strong  interfering  signals.  This  is  especially  noticeable  in  the  high  bandwidth 
efficiency  cases.  It  is  also  observed  that  in  the  presence  of  weak  interferers,  the  proposed 
scheme  performs  better  than  its  fixed-weights  counterpart. 

I.  INTRODUCTION 

A  conventional  single-user  detector  implemented  in  the  Code  Division  Multiple  Access  ( 
CDMA  )  system  consists  of  a  bank  of  filters,  each  one  matched  to  the  signature  sequence  of 
the  particular  user.  The  sampled  output  of  each  matched  filter,  besides  the  desired  signal, 
contains  the  residual  interference  from  all  other  users.  The  presence  of  a  strong  interfer¬ 
ence  often  makes  it  impossible  to  detect  the  weak  user,  a  condition  referred  to  as  a  near-far 
problem.  In  [l]  a  receiver  that  is  optimum  in  the  multiuser  interference  environment  was 
proposed  and  shown  to  eliminate  the  near-far  problem  and  provide  much  improved  perfor¬ 
mance.  The  improvement  comes  at  the  expense  of  high  computational  complexity.  A  class 
of  suboptimum  receivers  that  uses  decorrelating  detectors  and  which  is  based  on  the  linear 
transformation  of  the  sampled  matched  filters’  outputs  was  considered  in  [2]  and  [3].  The 
decorrelating  decision-feedback  detector  presented  in  [4]  utilizes  the  difference  in  received 
users’  energies  where  the  decisions  of  the  stronger  users  are  used  to  eliminate  interference  on 
weaker  users.  Another  approach  for  suboptimum  multi-user  detectors  with  low  complexity 
was  proposed  in  [5]  and  [6],  where  in  order  to  perform  detection  of  the  desired  user,  tentative 
decisions  on  information  bits  of  all  other  users  are  made.  The  estimate  of  the  multiple  access 
interference  is  then  obtained  and  is  subtracted  from  the  desired  signal.  The  performance 
of  some  of  these  suboptimum  schemes  is  close  to  the  performance  of  the  optimum  detector, 
particularly  when  the  power  of  the  interferers  increases,  they  become  indistiguishable.  How¬ 
ever,  these  schemes  have  to  perform  an  estimation  of  the  received  signal  energies,  knowledge 
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of  which  is  required  for  the  detectors’  proper  operation. 

The  detector  for  synchronous  CDMA  systems  proposed  here  consists  of  a  decorrelator  and 
a  nonlinear  multidimensional  interference  canceler  whose  weights  adaptively  adjust  to  the 
incoming  signal.  In  Section  2,  the  system  model  and  the  derivation  of  the  expressions  for  the 
steady  state  values  of  the  weights  and  the  output  error  probability  are  presented.  Numerical 
examples  and  a  discussion  are  presented  in  Section  3. 

II.  DETECTOR:  MODEL  AND  ANALYSIS 

A  synchronous  CDMA  receiver  considered  here  consists  of  a  bank  of  filters  (matched  to 
the  users’  signature  sequences)  front-end,  followed  by  samplers  and  the  decision  system,  as 
depicted  in  Fig.  1. 


Figure  1:  Proposed  receiver  scheme 

The  received  signal  r(t)  is  expressed  as: 

K 

r{t)  =  J2Yl^k{i)V^Sk{t-iT)  +  n{t)  (l) 

k=.l  z 

where  bk{i)  G  {  — 1,+1}  is  the  A;-th  user’s  data  bit  in  the  i-th  time  interval  and  n{t)  is  the 
additive,  zero  mean,  white  Gaussian  noise  with  the  two-sided  power  spectral  density  No/2. 
The  received  energy  of  the  A;-th  user  signal,  unknown  to  the  receiver,  is  denoted  as  a^.  While 
in  reality  it  is  slowly  time  varying,  it  is  assumed  to  remain  unchanged  over  the  transmission 
horizon  of  each  user.  The  signature  sequence  of  the  same  duration  T  as  a  data  bit,  is 

known  to  the  receiver.  The  sampled  outputs  of  the  bank  of  matched  filters  in  the  i-th  bit 
interval  can  be  expressed  as; 

x{i)  =  VAb{i)  -f-  n{i)  (2) 

For  the  sake  of  convenience  the  index  i  will  be  omitted  elsewhere  in  the  text.  In  (2), 
X  =  [xi,X2, . . .  ,xk]'^ ,  b  =  [&1,  62, . . . ,  6a']^,  a  =  diagjyrr,  ^02, . . . ,  ^oa]  is  a  diagonal 
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matrix  and  n  =  [ni,n2, .  •  .^nK^.  The  (fc,i)th  element  of  the  symmetric  crosscorrelation 
matrix  V  is  defined  as: 

T 

pkj  =  I  sk{t)sj{t)dt  k,j  e{l,2,...,K) 

J  0 

with  pkk  =  1-  The  covariance  matrix  of  a  zero  mean  Gaussian  noise  vector  n  is, 


Einn-^}  = 


The  outputs  of  the  decorrelator  are  given  as: 

z  =  V~^x  =  Ab  +  'P~^n  =  Ab  +  rn  =  Ab  +  ^  (3) 


where  jT  is  the  inverse  of  "P,  and  ^  =  Fn.  Let  b  —  61,62, .  • 

. ,  hx  be  the  decision  output 

of  the  decorrelator  defined  as: 

(4) 

b  =  sgn(2) 

Then  the  canceler’s  output  is  given  by: 

y^x-  W^b 

(5) 

where 

0  Wi2  ...  WiK 

W2l  0  ...  W2K 

Wki  WK2  ...  0 

The  output  for  the  k-th  user  can  be  expressed  as: 

Vk  =  Xk-  Wk^bk  (6) 

where  Wk  is  the  A:-th  column  vector  of  W  with  the  element  Wkk  deleted,  and  bk  is  vector 
obtained  from  b  by  deleting  the  element  h-  The  multidimensional  canceler  structure  is  the 
same  as  the  one  termed  in  array  processing  sidelobe  canceler,  except  for  the  fact  that  for 
auxiliary  signals  we  are  using  the  decisions  outputs  of  the  decorrelator.  For  controling  the 
weights  we  use  steepest  descent  algorithm  which  simultaneously  minimizes  the  output  signal 
powers  E{yl].  That  is,  for  the  k-th  output,  the  optimum  weights  are  obtained  by  iterative 
search: 

Wk  ■«—  Wk  —  — E{yl}  =  Wk  —  — E{{xk  —  w^hkY}  =  Wk  +  pE{ykbk}  (7) 

2  owk  2  owk 

Clearly  such  an  algorithm,  by  minimizing  the  output  power,  forces  the  correlation  between 
the  desired  output  signal  and  vector  of  tentative  decisions  of  interfering  signals  to  zero.  In 
practice,  the  expactation  operation  is  replaced  by  time  averaging. 
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The  steady  state  values  can  be  evaluated  as  follows: 


^^{2/^  =  0  -  -^^E{{xk-wrbk)ixk-wlbkf} 

=  ^ — E{xl-2xkbf,Wk  +  wlbkbi.Wk} 

OWk 

=  -2E{xkbk}  +  2E{bkbl}wk  (8) 

From  the  definition  of  6  it  is  obvious  that  E{bibj]  =  0  for  i  ^  j.  Also,  it  should  be  noted 
that  El^'nT'}  =  (iVo/2)/,  where  I  denotes  the  identity  matrix.  Then,  it  is  easily  shown  that: 

E{xkbk]  =  E\^^bkE  plAkbkEnk\bk]  =  AkE{bkb^]  (9) 

where  Ak  is  a  diagonal  (/i  —  1)  x  (/i  —  1)  submatrix  of  A  with  its  k-th  diagonal  entry 
removed,  pj^  is  a  (A"  —  1)  x  1  A;-th  column  vector  obtained  from  V  by  deleting  the  element 
Pkk^  and  bk  is  a  (/F  -  1)  x  1  vector  obtained  from  b  by  deleting  the  element  bk.  Clearly 

A  2^ 

E{bkbi. }  is  diagonal,  therefore  the  system  of  K  -  1  linear  equations  (8),  together  with  (9), 
gives  the  steady  state  values  of  the  weights  affecting  the  k-th  output  as: 

Wk=  E{bkbl}  AkE{bkbl}pj,  (10) 

The  expectations  in  the  above  expression  can  be  computed  as  follows: 

E{bibj}  =  E  {sgn  {y/E',bi  +  (i)  sgn  +  (j)  } 

—  2  ^  <  y/oibi^^j  >  ^/Ejbj}^ 

bi  ,bj 

where  each  probability  term  in  the  above  sumation  defines  four  integrals.  That  is: 


where  denotes  the  bivariate  Gaussian  density  function,  and  Di  an  appropriate  rectan¬ 
gular  region  of  integration.  The  covariance  matrix  of  the  two,  zero  mean,  random  variables 
and  can  be  obtained  as  a  submatrix  of  the  K  x  K  covariance  matrix 


Also: 


where: 


E{^e}  =  E{{ru){rnf}  =  ^r 


E{bibi}  =  1  —  2Pr{bi  in  error}  =  1  —  2(5 


la  No  1 2 


Q{x)  =  r 
s/^  Jx 


(12) 

(13) 
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Let  the  decision  output  of  the  second  stage  6  =  &2,  ■  •  • ,  be  defined  as: 

S  =  sgn{y)  (14) 

The  output  error  probability  for  fc-th  user  is  evaluated  as  follows: 

Pe.  -  E  ,  i  Pr{bk  in  evvox\bk,bk,bk} 

bk,bk,bk 

-  i  >  VEk  -  b^AkPk  +  '^Ibk} 

bktbk 

+  Pr{nk  <  —\/Ek  —  b^AkPk  +  P'>'{bk\bk}P'f'{bk} 

Since  Pr{bk}  —  and  Uk  is  a  zero  mean  Gaussian  random  variable  we  can  write: 

Pek  =  2'^'  Y.  [Pr{nk>  VE^k-blAkPk  +  wlk} 

bkhk 

+  Pr{nk>  ^/al:^blAkPk-wlbk]\Pr{bk\bk]  (15) 


It  can  be  shown  (see  Appendix)  that  Pr{bk\bk}  =  Pr{—bk\  —  bk},  and  therefore: 

Pek=2-^^-^^  E  [Pr{nk>V^k-blAkPk  +  wrbk}]Pr{bk\bk]  (16) 

bkhk 

Finally  Substituting  (10)  into  (16)  and  recognizing  that  Ak  is  diagonal,  we  obtain  for  the 
k-th.  user  error  probability: 


Pek=‘^ 


E  Q 

bkhk 


E{bkbl} 


E{bkbk}]  Akp 


No/2 


Pr{bk\bk}  (17) 


Pr{bk\bk}  is  the  integral  of  the  [K  —  l)-variate  Gaussian  density  function 


Pr{bk\bk} 


where  is  a  vector  obtained  from  ^  by  deleting  the  element  ^k  and  ^k  is  the  {K 
1)  X  {K  —  1)  covariance  matrix  of  ^k  obtained  from  (12)  by  eliminating  the  A;-th  row  and 

column.  Introducing  Fk  —  {Nol2)~^3k,  and  Rk  =  diag  a/S7V^,  ...  , 
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oo 

vsm  ‘ 

Pr{bk\b,}  =  ■■■  exp  i-XlRkri^RkXk)  dX^ 

J  J  ^  I 

6;Ai  =  -6;b; 

i—l,...,K  i^k 

The  probability  of  error  can  be  expressed  as: 

r..  =  5:  Q  USNRt  -  (bl  -  bl  E{hbl})  fitpd]  Pr{K\h} 

Kk  L  \  V  /  /J 

(18) 

III.  NUMERICAL  EXAMPLES  AND  DISCUSSION 


In  this  section,  two  sets  of  numerical  examples  for  the  bit  error  probabilities  of  the  proposed 
adaptive  detector  are  calculated  and  presented.  For  comparison  purposes,  the  error  perfor¬ 
mances  of  the  corresponding  fixed-weights  detector  of  Ref.  [6]  (in  which  the  weights  are  set, 
based  on  the  knowledge  of  the  received  signals’  energies,  to  wji  =  pijy/d^),  and  those  of  the 
decorrelating  detector  are  included. 

The  first  example  depicted  in  Fig.  2  is  a  simple,  two-user  case,  but  it  nevertheless  provides 
some  insight  into  the  steady  state  behavior  of  the  adaptive  detector.  The  crosscorrelation 
coefficient  taken  to  be  0.7,  can  certainly  be  considered  as  representing  a  high  bandwidth 
efficiency  case.  The  SNR  of  user  1  is  set  to  8  dB,  while  the  SNR  of  user  2,  relative  to  user 
1,  varies  from  -10  to  8  dB.  From  this  figure  we  notice  first  that  when  the  interferer  (user  2)  is 
strong,  user  1  achieves  the  performance  of  the  single  user  system.  Also  noticeable  is  the  fact 
that  the  adaptive  system  performs  the  same  as  the  system  with  fixed  weights  [6].  On  the 
other  hand,  with  a  weak  interferer,  and  consequently,  poor  tentative  decision  62,  the  adaptive 
system  slightly  outperforms  the  corresponding  fixed-weights  scheme.  The  improvement  can 
be  explained  by  comparing  their  respective  amounts  of  residual  interference.  In  the  former, 
from  (6),  it  is  equal  to  Pl2^/a2  [62  —  Fl{62^2}^2  ,  which  is  certainly  less  than  in  the  latter 

case  of  Pi2a/o2  ^2  —  ^2  ■  For  almost  all  SNR’s  the  performance  of  the  decorrelating  detector 
compares  unfavorably  to  the  other  two. 

In  the  second  set  of  examples.  Gold  sequences  of  length  seven  (frequently  used  in  the  liter¬ 
ature,  e.g.,  [4]  and  [6])  are  chosen  for  signature  waveforms.  The  rationale  for  such  a  choice 
is  that  Gold  sequences  are  regularly  used  in  an  asynchronous  CDMA  environment  and  the 
study  of  its  proposed  synchronous  counterpart  may  provide  a  useful  indication  of  the  per¬ 
formance  of  the  former.  The  crosscorrelation  matrix  R  in  this  case  is: 

■7-1  3  3  3  ■ 

^  -1  7-1  3-1 

V  =  -  3-1  7-1-1 

3  3-17-1 

3  -1  -1  -1  7 
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The  two-user  case  with  Gold  sequences  is  shown  in  Fig.  3.  As  expected,  in  this  low  band¬ 
width  efficiency  scenario  (pi2  =  -1/7)  the  decorrelating  detector  performs  as  well  as  the 
other  two  schemes.  By  adding  an  additional  user,  Fig.  4,  the  decorrelating  detector  begins 
to  exibit  its  inadequacy.  The  adaptive  and  the  fixed-weights  scheme  show  virtually  identi¬ 
cal  performance,  with  the  former  being  only  slightly  better  for  weak  interferers.  With  the 
number  of  simultaneous  users  increasing  further  to  K=4  and  K=5,  as  in  Figs.  5  and  6,  re¬ 
spectively  (note  the  change  of  the  vertical  axis  scaling),  certain  trends  become  more  obvious. 
Due  to  its  unacceptable  high  probability  of  error,  the  decorrelating  detector  clearly  does  not 
represent  an  appropriate  choice.  When  the  interferers  are  strong,  both  the  adaptive  and  the 
fixed-weights  scheme  achieve  the  performance  of  the  single  user.  The  former  provides  better 
error  performance  with  weak  interferers. 

Finally,  Fig.  7  depicts  the  error  probabilities  in  the  case  when  all  users  maintain  the  same 
SNR’s.  It  shows  the  extent  of  the  performance  deterioration  of  the  system  as  the  number  of 
simultaneous  users  increases. 

IV.  CONCLUSION 

The  adaptive  code  division  multiple  access  detector  proposed  here  was  shown  to  provide 
significantly  better  steady  state  error  performance  than  the  decorrelating  detector.  This  is 
especially  true  under  the  most  critical  conditions  for  the  multiuser  environment,  such  as 
near-far  situations  and  high  bandwidth  efficiency  utilization.  In  the  presence  of  strong  in¬ 
terference  the  proposed  detector  achieves  the  performance  of  the  optimum  detector.  When 
interfering  signals  are  weak,  it  performs  slightly  better  than  its  non-adaptive  counterpart 
(one  that  requires  prior  estimation  of  the  received  signal  energies),  but  without  requiring 
knowledge  of  the  received  signal  energies.  This  paper  deals  with  the  synchronous  case, 
which  is  not  aplicable  to  base  station  processing.  The  asynchronous  case,  as  an  extension  to 
the  synchronous  case  approach,  is  currently  being  pursued  by  the  authors. 

V.  APPENDIX 

By  definition: 

Pr{bk\bk}  = 


Pr{sgn{^/a^bl  +  ^i), . . .  ,sgn{^ak-ibk-i  +  6-i)) 
sgn{y/ak+ibk+i  +  6+i)?  ■  • .  ,sgn{^/a^'bK  +  Ck)} 
Pr{bi^i  >  . . . ,  >  -^/ak-ibk-ih-i, 

bk+iCk+l  >  —y/oP^bk+ibk+ii  .  .  •  ,  bK^K  >  — 

Pr{bi^i  <  ^/^blbl, . . .  ,bk--i^k-i  < 

bk+i^k+i  <  y/dk+ibk+ibk+i,.  ■  ■ ,  bxiK  <  y/^bKbK} 

Pr{~bk\  -bk} 
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APPENDIX  B:  PART  II 


SYNCHRONOUS  MULTIUSER  CDMA  DETECTOR 
WITH  SOFT  DECISON  ADAPTIVE  CANCELER 


by 

David  W.  Chen,  Zoran  Siveski  and  Yeheskel  Bar-Ness 

ABSTRACT 


A  two-stage  multiuser  detector  for  the  synchronous  code  division  multiple  access  (CDMA), 
additive  Gaussian  noise  channel  is  considered.  It  employs  a  combination  of  a  decorrelator 
and  a  nonlinear  multiuser  interference  canceler  that  utilizes  soft  tentative  decisions.  The 
weights  of  the  canceler  are  adaptively  adjusted,  thus  rendering  the  knowledge  of  the  received 
signal  energies  unnecessary.  The  steady  state  error  performance  of  the  detector  is  obtained 
and  found  to  be  superior  to  the  performance  of  the  same  detector  that  utilizes  hard  tentative 
decisions. 

I.  INTRODUCTION 

A  synchronous  CDMA  receiver  that  employs  a  combination  of  a  decorrelator  and  a  multiuser 
interference  canceler  and  whose  weights  are  adaptively  controlled  was  presented  in  [1].  The 
scheme  did  not  require  a  prior  estimate  of  received  signal  energies,  as  was  the  case  with 
its  non-adaptive,  fixed  weight  counterpart  described  in  [2].  The  tentative  data  bit  estimates 
used  in  the  canceler  were  obtained  as  hard  decisions  on  decorrelator  outputs.  In  the  presence 
of  strong  multiuser  interference,  in  the  steady  state,  the  adaptive  scheme  approached  the 
performance  of  an  optimum  receiver,  while  in  the  presence  of  weak  interference  it  performed 
better  than  [2].  In  this  paper,  soft  tentative  decisions,  in  the  manner  described  in  [3,4]  are 
used  instead,  and  the  steady  state  error  performance  of  the  detector  is  evaluated. 

IL  DETECTOR:  MODEL  AND  ANALYSIS 

The  detector  considered  here  is  shown  in  Fig.  1.  The  received  signal  r{t)  is  expressed  as: 

K 

r(t)  =  XI  m  -  *T)  +  n(f), 

i 

where  bk{i)  G  {  —  1,  +1}  is  the  fc-th  user’s  data  bit  in  the  Tth  time  interval  and  n{t)  is  the 
additive,  zero  mean,  white  Gaussian  noise  with  the  two-sided  power  spectral  density  No/2. 
The  received  energy  of  the  fc-th  user  signal,  unknown  to  the  receiver,  is  denoted  as  ak-  The 
signature  sequence  Sk{t).,  of  the  same  duration  T  as  a  data  bit,  is  known  to  the  receiver.  The 
sampled  outputs  of  the  bank  of  matched  filters  in  the  z-th  bit  interval  can  be  expressed  as: 

X  =  VAb  -f  n. 
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where:  x  =  [a^i,  X2, . .  • ,  xkY ,  b  =  [bi,  62, ,  6^]^, 
n=  [ni,  n2, . . . ,  uk]'^  and  A  =  diag[y/a^,  . . . ,  ^/aK]. 

The  (A;,  j)th  element  of  the  symmetric  crosscorrelation  matrix  V  is  defined  as: 

fT 

Pkj  / 

Jo 

with  pkk  =  1-  The  outputs  of  the  decorrelator  are  given  as: 

z  =  V~^x  =  Ab  +  V~^n  =  Ab  + 
where  z  =  [zi,  Z2,  •  ■  ■ ,  ^A']^  and  ^  =  [6, 6,  ■  •  ■ , 


The  output  for  the  A:-th  user  can  be  expressed  as: 

K 

Vk  =  Xk -wjhk  =  Xk -Y^wikhik,  (1) 

1  =  1 
l^k 

where  Wk  'is  the  fc-th  column  vector  of  a  A"  x  K  weight  matrix  W,  with  the  element  Wkk  =  0 
deleted,  and  hk  is  the  A;-th  column  vector  of  a  A  x  K  matrix  ff,  with  the  element  hkk  =  0 
deleted.  The  (/,  A;)th  element  of  the  matrix  H  represents  the  output  of  the  soft  limiter  to 
the  input  z;,  and,  for  l,k  =  1,. . .  ,K  I  ^  k,  is  defined  as: 


l,  ^ll^lk  1-2/1  <  ^Ik 

^  I  sgn{zi)  otherwise. 

The  limiter’s  parameter  Xik  is  determined  heuristically  from  the  observed  values  of  the  decor- 
relator  outputs  as: 

=  z{|.,|}  ’ 

where  the  above  expectation  is  evaluated  as: 

Z{kl}  =  v^(l-2Q(^))+^e-^. 


E{\zk\}  =  k/rk  1-2(5 


where: 


and  cr?  is  the  variance  of  ^k- 


For  controlling  the  weights,  the  steepest  descent  algorithm,  which  simultaneously  minimizes 
the  output  signal  powers  E{yl},  is  used.  That  is,  for  the  k-iii  output,  the  optimum  weights 
are  obtained  by  the  iterative  search 


»4.  +  i)  =  -‘W  - 

=  {I  -  iiE{hkhl])wk{i)  +  pE{xkhk}. 


(In  practice,  the  expectation  operation  is  replaced  by  time  averaging). 

The  steady  state  values  can  be  evaluated  from: 

- — E{yl}  =  0  = -E{xkhk}  +  E{hkhl}wk.  (4) 

dwk 

It  is  easily  shown  that: 

E  }■  ^  "h  P]^  -^k^k  "h  '^k  ^k  ^ 

=  [Afc£{6fcfefc}]^Pfc, 

where  Ak  is  a  diagonal  {K  —  1)  x  (K  —  1)  submatrix  of  A  with  its  fc-th  diagonal  entry 
removed,  is  a  {K  —  1)  x  1  A;-th  column  vector  obtained  from  'P  by  deleting  the  element 
pkki  and  bk  is  a  (K  —  1)  x  1  vector  obtained  from  b  by  deleting  the  element  bk-  Clearly 
E{bkhl}  is  diagonal;  therefore,  the  system  of  —  1  linear  equations  (4)  gives  the  steady 
state  values  of  the  weights  affecting  the  A;-th  output  as: 

Wk  =  [E{hkhl}]  ^  AkE{bkhl}Pk.  (5) 

The  expectations  in  the  above  expression  are  given  in  the  Appendix. 


Defining  the  A:-th  user  final  decision  output  as  bk  =  sgn{yk),  its  probability  of  error  is 
evaluated  as  follows: 

Pet,  —  ^ 

=  ^^YlPh,P^{-VP~k  +  blAkPk 

bk 

-w^hk  +  nk>  0|A;t}. 

Introducing  the  transformed  Gaussian  random  variable 


K 

^  nk-Yh  Ci 


i=l 

,^k 


P  = _ 

e*  2K-1 

Ph^,P^{'^k  >  ^/^k  -  blAkpk  +  wlgf^\hk], 

bk 
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where 


9k  =  [gn,  92k,  ■  ■  ■ ,  9k-i,k, gk+i,k,  ■  ■  ■ ,  gxf  with 

_  /  \/^bi(Xik  \zi\  <  \ik  ■  _  K  i  ^  h 

1  sgn[zi)  otherwise 


Defining  the  vector  C, 

C  ~  [^1)  ^2)  •  •  •  )  Ck—l,  ^A:+l  ■,■••,  ^-K]  1 

the  hnal  expression  for  the  error  probability  is  obtained  as: 


2/v — i 

E  Jo 

h  n=l 


where  is  a  /f"-variate  Gaussian  density  function,  £)„  is  a  hyper  cube  defined  by:  ^/aiJ 

hlAkPk  +  <  Ck  and  for  f  =  1, 2, . . . ,  A",  zV 

Xik  Ci  Xik  \^i\  ^  Xik 

Ci  >  Xik  -  y/oTib,,  Ci  <  -Xik  -  y/^b,  otherwise. 

III.  NUMERICAL  EXAMPLES  AND  DISCUSSION 


The  steady  state  error  probabilities  are  presented  for  the  two  and  three-user  cases.  The 
signal  to  noise  ratio  for  user  k  (SNRk)  is  defined  as  akJNo-  In  Fig.  2,  SNRi  is  set  to  8  dB 
while  SNR2/SNR1  varies  from  -10  to  8  dB.  The  value  of  the  crosscorrelation  coefficient 
pi2  =  0.7,  which  represents  a  high  bandwidth  efficiency  case.  Compared  to  the  error  per¬ 
formance  of  the  detector  from  [1],  which  utilizes  hard  decisions,  and  the  performance  of  the 
decorrelator,  a  significant  improvement  has  been  obtained. 

In  the  next  two  examples,  Gold  codes  of  length  seven  (e.g.,  [2]  and  [1])  were  used  for  signa¬ 
ture  waveforms.  For  the  two-user  case,  in  a  low  bandwidth  efficiency  case  with  pi2  =  —1/7 
(Fig.  3)  as  expected,  a  performance  of  the  decorrelator  is  very  close  to  the  single  user  bound. 
Therefore,  a  negligible  improvement  is  obtained  with  either  the  hard  or  soft  tentative  deci¬ 
sions.  The  three-user  case  with  pis  =  3/7  and  P23  =  -1/7  is  depicted  in  Fig.  4.  Here,  the 
two-stage  detector  with  a  soft  limiter  clearly  shows  the  best  performance.  For  a  different 
set  of  crosscorrelation  coefficient  values  {pu  =  0.5,  pis  =  0.5,  and  P23  =  0.2)  in  Fig.  5, 
the  difference  in  the  performance  between  the  decorrelator  and  the  two-stage  detector  is 
larger,  with  the  soft  limiter  again  being  a  better  one.  Finally,  Fig.  6  depicts  the  error  proba¬ 
bilities  in  the  case  when  all  the  users  maintain  the  same  SNR's  and  the  Gold  codes  are  used. 


IV.  APPENDIX 


Let  f^,  be  the  density  function  of  Zi,  and  the  joint  density  function  of  Zi  and  zj.  Then: 


E{hlk}  = 


fzidzi  + 
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I 


+  j  Jz  ^^^9n{zj)f,^,Jzidzj 

+  J  Jz  ^^^9n{zi)f,^^dzidzj 

-  Uz.  sgn{z,)sgn{zj)f^^,Jz,dzj 


E{bihik}  = 

+  Jj^  bisgn{zi)f^^dzi  , 


where  Li,  i  =  1,2  correspond  to  the  appropriate  intervals  of  Zi,  and  Z,,  i 
spond  to  the  appropriate  rectangular  regions  in  the  {zi,Zj)  plane. 


. ,  4  corre- 
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Figure  1:  Figure  1:  Proposed  detector  scheme 
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Figure  2:  Error  probability  of  user  1  for  K  =  2,  and  SNRi  =  8  dB  and  pi2 
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Figure  3:  Figure  5:  Error  probability  of  user  1  for  K  =  3 
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Figure  4:  Figure  6:  Error  probability  of  user  1  for  K  =  2  and  K  =  3 
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BOOTSTRAPPED  DECORRELATING  ALGORITHM  FOR 
ADAPTIVE  INTERFERENCE  CANCELATION  IN 
SYNCHRONOUS  CDMA  COMMUNICATIONS  SYSTEMS 


by 

Yeheskel  Bar-Ness,  Zoran  Siveski  and  David  W.  Chen 


ABSTRACT 


A  code  division  multiple  access  detector  that  employs  a  combination  of  a  correlation  detector 
and  a  multiuser  adaptive  interference  canceler  was  previously  proposed  by  the  authors.  The 
weights  of  the  canceler  are  adaptively  controlled  by  the  steepest  descent  algorithm  using  the 
minimum  power  criterion  of  the  canceler  outputs.  It  was  shown  that  such  a  canceler  sub¬ 
stantially  outperforms  the  decorrelating  detector  and  has  almost  the  same  error  probability 
as  a  canceler  that  uses  reliable  estimates  of  the  received  signal  energies.  In  this  paper,  a 
different  weight  control  criterion  based  on  minimizing  the  correlation  between  the  signals 
at  the  outputs  of  the  canceler  is  used,  and  its  performance  is  compared  to  that  obtained 
with  the  minimum  power  criterion.  It  is  shown  that  both  cancelers  have  almost  the  same 
performance,  particularly  when  SNR  is  modestly  high.  Although  it  is  not  shown  in  this 
paper,  we  believe  that  minimizing  correlation  rather  than  power  outputs  is  a  more  suitable 
criterion  when  dealing  with  highly  dispersive  channels. 

I.  INTRODUCTION 

A  conventional  single  user  detector  implemented  in  the  Code  Division  Multiple  Access 
(CDMA)  system  consists  of  a  bank  of  filters,  each  one  matched  to  the  signature  sequence  of 
the  particular  user.  The  sampled  output  of  each  matched  filter,  besides  the  desired  signal, 
contains  the  residual  interference  from  all  other  users.  In  particular,  for  mobile  and  wireless 
personal  communication  the  choice  of  CDMA  is  attractive  because  of  its  potential  capacity 
increase  and  other  technical  factors,  such  as  anti-multipath  fading  capability.  However,  for 
satisfactory  performance,  one  must  consider  the  effect  of  the  “near-far  problem.  So  far, 
practical  systems  have  used  power  control  to  combat  this  problem.  Commercial  digital  cel¬ 
lular  systems  based  on  CDMA,  and  which  use  stringent  power  control  are  described  in  [1]. 
They  offer  capacity  increases  over  TDMA.  A  different  approach  for  combating  the  near-far 
problem  was  suggested  by  Verdu  [2],  where,  an  optimum  receiver  in  the  multiuser  interfer¬ 
ence  environment  was  proposed  and  shown  to  eliminate  the  near-far  problem  and  provide 
much  improved  performance.  The  improvement  comes  at  the  expense  of  high  computational 
complexity.  A  class  of  suboptimum  receivers  that  uses  decorrelating  detectors  and  which 
is  based  on  the  linear  transformation  of  the  sampled  matched  filters’  outputs  was  consid¬ 
ered  in  [3]  and  [4].  The  decorrelating  decision-feedback  detector  presented  in  [5]  utilizes  the 
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difference  in  received  users’  energies  where  the  decisions  of  the  stronger  users  are  used  to 
eliminate  interference  on  weaker  users. 

Another  approach  for  suboptimum  multiuser  detectors  with  low  complexity  was  proposed  in- 
[6]  and  [7],  The  performance  of  some  of  these  suboptimum  schemes  is  close  to  the  performance 
of  the  optimum  detector.  Particularly  when  the  power  of  the  interferes  increase,  they  become 
indistinguishable.  However,  these  schemes  have  to  perform  an  estimation  of  the  received 
signal  energies,  knowledge  of  which  is  required  for  the  detector’s  proper  operation. 

Still  another  approach  for  improving  the  performance  of  CDMA  systems  in  multiuser  envi¬ 
ronments  is  implementing  adaptive  algorithms  for  signal  separation  [8,  9]. 

In  contrast  to  conventional  interference  cancelers,  wherein  the  interference  to  desired  signal 
ratio  is  high,  at  the  output  of  the  matched  filter  of  a  Direct  Sequence  Code  Division  Multiple 
Access  (DS/CDMA),  the  desired  signal  is  higher  and  the  designer’s  aim  is  to  further  reduce 
the  interference.  Unless  special  arrangements  are  made,  conventional  cancelers  are  useless 
for  removing  interference  in  CDMA  systems.  Such  an  arrangement  includes,  for  example, 
the  use  of  a  reference  in  the  form  of  an  estimate  of  the  desired  signal,  which  can  be  obtained 
by  a  training  sequence,  etc.  Clearly,  such  an  approach  is  difficult,  particularly  in  vehicular 
communication.  Variable  desired  signal-to-interference  ratios  resulting  from  the  “near-far” 
problem  make  the  difficulty  even  worse.  Instead,  we  used  in  [9]  and  in  this  paper,  the 
tentative  decision  on  the  data  following  the  decorrelation  transformation  (see  Figure  1). 
When  dealing  with  an  adaptive  scheme,  the  questions  are  what  kind  of  performance  index 
and  which  algorithm  to  use.  While  the  minimum  power  criterion  was  implemented  in  [9], 
here  we  propose  to  use  a  special  form  of  decorrelating  algorithm  frequently  used  in  neural 
networks  and  other  applications  of  signal  separation  [10,11]. 

In  Section  2,  the  system  model  of  the  adaptive  canceler  will  be  described  and  the  decorre¬ 
lating  algorithm  will  be  presented.  An  analytical  expression  for  the  equations  from  which 
the  steady  state  mean  value  of  the  controlled  weight  will  be  given  and  simplified  using  some 
approximation  particularly  valid  for  reasonably  high  SNR’s.  The  output  error  probability  is 
calculated  in  Section  3  (using  the  aforementioned  approximation),  then  exact  computation 
of  this  probability  for  the  case  of  two  users  is  given  in  Section  4.  Section  5  contains  compu¬ 
tational  and  simulation  results  for  different  signal’s  scenarios,  after  which  some  conclusions 
are  drawn. 

II.  THE  ADAPTIVE  CANCELER:  MODEL  AND  ANALYSIS 

A  synchronous  CDMA  receiver  considered  here  consists  of  a  bank  of  filters  (matched  to 
the  users’  signature  sequences)  front-end,  followed  by  samplers  and  the  decision  system,  as 
depicted  in  Fig.  1. 

The  received  signal  r(t)  is  expressed  as: 

K 

r{t)  =  Y^Y^hk{i)^/^Sk{t -iT) +  n{t)  (1) 

A:=l  i 

where  hk{i)  €  {  — l,+l}  is  the  A;-th  user’s  data  bit  in  the  Fth  time  interval  and  n{t)  is  the 
additive,  zero  mean,  white  Gaussian  noise  with  the  two-sided  power  spectral  density  Nof2. 
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The  received  energy  of  the  k-th.  user  signal,  unknown  to  the  receiver,  is  denoted  as  ak-  The 
signature  sequence  of  the  same  duration  T  as  a  data  bit,  is  known  to  the  receiver.  The 

sampled  outputs  of  the  bank  of  matched  filters  in  the  i-th  bit  interval  can  be  expressed  as: 

x{i)  =  VAb{i)  +  n{i)  (2) 

For  the  sake  of  convenience,  the  index  i  will  be  omitted  elsewhere  in  the  text.  In  (2), 

x  =  [xi,X2,...  ,XKf,  b  =  [fei,  62, .  -  - ,  bK]:  A  =  diag[^,  . . . ,  is  a  diagonal  matrix 

and  n  =  [ni,n2,  • . .  The  {k,j)th  element  of  the  symmetric  cross-correlation  matrix 

"P  is  defined  as:  ^ 

pkj  =  I  Sk{t)sj{t)dt  A;,;  e  (l,2,...,/\) 

Jo 

with  pkk  =  1-  The  covariance  matrix  of  a  zero  mean  Gaussian  noise  vector  n  is, 

E{nn^)  =  y-P 

The  outputs  of  the  decorrelator  are  given  as: 

z  =  V-^x^  Ab  +  V-^n^  Ab  +  rn  =  Ab  +  i  (3) 

where  F  is  the  inverse  of  "P,  and  ^  =  Fn.  Let  b  =  &i,  62, . . . ,  be  the  decision  output 

of  the  decorrelator  defined  as: 

6  =  sgn(2)  (4) 

Then  the  canceler’s  output  is  given  by: 

y  =  X-  W^i)  (5) 

where 

■  0  u;i2  . . .  WiK 

W21  0  ...  W2K 

W=  .... 

WKl  WK2  •  •  •  0 

The  output  for  the  k-th.  user  can  be  expressed  as: 

Vk^  Xk-  Wk^bk  (6) 

where  Wk  is  the  A:-th  column  vector  of  W  with  the  element  Wkk  deleted,  and  bk  is  the  vector 
obtained  from  b  by  deleting  the  element  bk-  The  multidimensional  canceler  structure  is  the 
same  as  the  one  termed  in  an  array  processing  sidelobe  canceler,  except  for  the  fact  that  for 
auxiliary  signals  we  are  using  the  decision  outputs  of  the  decorrelator. 

For  controlling  the  weights,  we  use  the  steepest  descent  algorithm,  which  simultaneously 

reduces  the  absolute  value  of  the  correlation  between  the  data  output  bm  of  channel  m  and 
all  other  outputs  of  the  canceler  (before  decision).  That  is  the  weight  Wmi  is  controlled  by 
the  recursion 
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Wrai  Wmi  -  ^E{yi?,gn{yrn)}  forl</,m<A"  l^m  (7) 

To  see  the  logic  behind  this  recursion  for  controlling  the  weight  we  refer  to  Fig.  2,  which 
is  the  special  case  of  three  users.  Recursion  (7)  reaches  steady  state  in  the  mean  when 
E{yisgn{ym)}  =  0.  Notice  that  w^i,  for  example,  is  used  to  cancel  residue  of  data  62  at  the 

output  yi-  It  will  settle  down  only  when  that  residue  (being  correlated  with  b)  is  zero.  But 

any  reduction  of  62  at  yi  will  improve  61  (smaller  error)  and,  hence,  will  be  more  effective  in 
reducing  the  residue  of  bi  at  ys  and  y2  through  the  control  weights  u;i2  and  rcis.  Therefore,  the 
process  of  residue  cancelation  is  enhanced  successively,  which  justifies  the  name  “bootstrap.” 
In  vector  form,  by  using  the  notation  of  Eq.  6,  we  write: 

Wk  ^  Wk- yE{ykSgn{yi^)]  (8) 

where  again  y^.  is  obtained  from  the  column  vector  y  by  deleting  the  element  yk. 

The  steady  state  would  be  reached  if 

E{ykSgn{yk)}  =  0,  for  A;  =  1, 2, ... ,  K 

This  means  that  K{K  —  1)  nonlinear  equations  have  to  be  evaluated.  (An  example  of  K  =  2 
for  finding  the  steady  state  values  of  W21  and  W12  will  be  given  in  the  next  section). 

In  the  general  case,  it  will  be  assumed  that  the  signal-to-noise  ratios  (SNR’s)  are  large  enough 
such  that  the  main  contribution  to  the  output  error  is  the  multiuser  interference. 

Therefore,  E{ykSgn.{yi.)}  =  E{ykbk}  can  be  approximated  as: 

E{ykbk}  ~  E{ykbk}{l -2Pr{bk  m  evTOT)) 

-  E{ykbk}{l-2P,,)  (9) 

Substituting  (6)  into  (9)  with  Xk  taken  from  (2),  we  get: 

E{ykbk}  ~  E  \^(^y/aiibk  + plAkbk  +  nk-wlbk)bk] 

•(l-2Pe.  ), 

and  the  algorithm  will  result  in  having: 

y/a^E{bkbk}  +  E  {(^plAkbk)  6/,} 

+E  {ukbk}  -  E  [(wlbk)  bk]  =  0 

It  is  easy  to  show  that  E  {bkbk}  =  0,  E  \^(^plAkbk)  bk}  =  Alp,.,  E  {nkbk}  =  0,  and 
E  {  (wlbk'j  bk]  =  BkWk,  where  Bk  =  diag  [A  j  =  1,  2, . . . ,  /F  j  ^  k.  Therefore, 

Wk  =  B^^Akpk  (10) 

where  we  used  the  fact  that  Ak  is  diagonal. 
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III.  OUTPUT  ERROR  PROBABILITY 


The  output  error  probability  for  the  k-th  user  is  evaluated  as  follows: 

Pek  =  P.  h  i  Pr{bk  in  eTroT\bk,bk,bk} 

*  bk,Ok,Ok 

=  ^  X]  [Pr{nk>  y^-blAkpk  +  wlbk} 
bk,bk 

+  Pr{nk  < -y/a^  -  b^AkPk  +  wlbk}  ■ 

Pr{bk\bk}Pr{bk} 

Since  Pr{bk}  =  and  rik  is  a  zero  mean  Gaussian  random  variable  we  can  write: 

Pek  =  2“^  Yj  [P'^i'^k  >  y/ok  -  blAkPk  +  wlbk} 

bkhk 

+  Pr{nk  >  +  blAkPk  -  ^Ih]]  Pr{bk\bk}  (11) 

Also 

Pr{bk\bk} 

=  Pr{sgn{^/^bl  A  . . .  ,sgn{^ak-ibk-i  +6-i)? 

sgn{^yak+ibk+i  +  tk+i),  ■  •  •  ,sgn(v^6A'  +  ^k)} 

=  Pr{6i<fi  >  —^/^blb■^, . . . ,  h-i^k-i  > 

—  y/ak-ibk-ibk-i,  bk+iCk+i  >  —\/nk+ibk+ibk+i, 

•  ■  ■  ,  bK^K  >  — \/^^A'^A'} 

=  Pr{bi^i  <  ^/albibi, . . . ,  bk-iik-\  < 

bk+i^k+i  <  ^/dk+ibk+ibk+ii  •  •  •  5  bfc^K  <  \/^^A'^A'} 

=  Pr{-6,l-6,}  (12) 

Applying  the  result  of  (12)  to  the  second  term  in  (11)  enables  splitting  the  latter  into  two 
equal  terms.  Therefore 

Pek  =  5  [Pr{nk>  ^/^-blAkPk  +  wlbk} 

bk 

Pr{bk\bk}  (13) 
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where  Pr{bk\bk}  is  the  integral  of  the  [K  —  1)  variate  Gaussian  density  function. 

The  Two-user  Case 


For  this  case,  one  can  solve  for  the  optimal  weight  without  the  need  for  the  approximation  of 
(9),  although  the  resulting  two  simultaneous  equations  in  u;2i,  Wu  are  highly  nonlinear  and 
require  numerical  method  for  its  solution.  The  calculated  weight  can  then  be  substituted  in 
the  following  equation  to  obtain  the  error  probability; 


where, 


IV.  COMPUTATIONAL  AND  SIMULATION  RESULTS 

Two  sets  of  examples  are  used  to  examine  the  performance  of  the  proposed  canceler.  In  the 
first  set,  we  consider  the  two-user  case  with  cross  correlation  coefficient  pi2  =  0.7.  It  can 
certainly  represent  a  high  bandwidth  efficiency  case.  Here  we  hnd  by  computation  the  error 
probability  as  a  function  of  the  SNR’s  difference  of  the  two  users  while  the  SNR  of  user  1 
is  kept  constant  at  8  dB  and  12  dB.  This  is  shown  in  Fig.  3  and  Fig.  4,  respectively.  In 
the  figures,  we  added  the  results  of  simulation,  those  obtained  by  using  the  approximation 
introduced  in  Eq.(9),  and  those  resulting  from  the  decorrelating  detector  of  Verdu  [3]. 

In  the  second  set  of  examples.  Gold  sequences  of  length  seven  (frequently  used  in  the  liter¬ 
ature,  e.g.,  [4]  and  [6])  are  chosen  for  signature  waveforms.  The  rationale  for  such  a  choice 
is  that  Gold  sequences  are  regularly  used  in  an  asynchronous  CDMA  environment  and  the 
study  of  its  proposed  synchronous  counterpart  may  provide  a  useful  indication  of  the  per¬ 
formance  of  the  former.  The  cross-correlation  matrix  P  in  this  case  is; 

‘7  -1  3  3  3  ■ 

^  -1  7-1  3-1 

V  =  -  3-1  7-1-1 

3  3-17-1 

3  -1  -1  -1  7 
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In  Fig.  5,  we  depict  the  result  of  error  probability  of  user  1  having  SNR— 8dB  as  a  function 
of  the  SNR  of  the  other  four  users  (taken  to  be  the  same).  These  results  are  obtained  by 
simulations  and  computation  with  the  approximation  suggested  in  Eq.(9).  For  comparison, 
the  results  of  the  decorrelating  detector  are  also  included.  Fig.  6  is  the  same  except  for 
user  1,  where  SNR=12  dB.  Finally,  Fig.  7  depicts  the  error  probabilities  of  user  1,  whose 
interferences  are  determined  by  the  first  column  of  "P  and  having  the  same  power  as  the 
other  one  to  four  interfering  users.  For  comparison  purposes  we  also  show  the  results  ob¬ 
tained  by  using  the  minimum  power  criterion  and,  as  reference,  those  obtained  when  using 
the  decorrelating  detector.  It  was  established  that  the  case  of  equal  user  power  represents 
approximately  the  worst  case  scenario  for  user  1.  Also  notice  from  V  that  other  users  suffer 
lower  levels  of  interference  by  other  users. 

V.  CONCLUSION 

An  adaptive  interference  canceler  for  the  multiuser  CDMA  system  is  proposed  in  this  paper. 
For  the  control  of  its  adaptive  weights,  we  suggested  using  a  special  form  of  the  outputs’ 
decorrelating  algorithm,  customarily  used  in  neural  networks  and  other  applications  of  blind 
signal  separation  and  equalization.  It  was  shown  that,  particularly  for  a  high  level  of  interfer¬ 
ing  signals,  the  canceler  outperforms  the  decorrelating  detector,  which  is  regularly  suggested 
as  a  mean  for  combating  the  near-far  problem  without  excessive  hardware  complexity.  Al¬ 
though  it  is  difficult  to  tackle  the  algorithm  analytically,  an  approximation,  suggested  in  this 
paper,  made  the  task  possible.  Both  the  algorithm  in  this  paper  and  the  minimum  output 
powers  algorithm,  previously  proposed  by  the  authors,  depict  similar  error  probability  per¬ 
formance,  satisfactory  for  candidacy  in  controlling  adaptive  interference  cancelers  for  CDMA 
systems.  Although  we  assume  synchronous  data  signals  in  both  cases,  extension  to  the  asyn¬ 
chronous  case  is  also  possible.  Examination  of  the  performance  of  these  two  cancelers  for 
dispersive  channels  and,  in  particular,  the  difference  in  behavior  of  the  two  algorithms  in 
such  situations  is  the  topic  of  current  research. 
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Fig.  1  Proposed  multiuser  receiver  scheme 
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SNRi-SNRl  [dB],  i=2. 3. 4. 5 

Fig. 5  Probability  of  error  for  user  1  (K=5,  SNRi  =  8  dB,  Gold  codes) 


SNRi-SNRl  [dB),  1=2, 3, 4,5 

Fig. 6  Probability  of  error  for  user  1  (K=5,  SNRi  =  12  dB,  Gold  codes) 


SNRi  i=l,2.3,4.5 

Fig. 7  Probability  of  error  for  user  1  (K=5,  Gold  codes) 


APPENDIX  B:  PART  IV 


ADAPTIVE  MULTIUSER  CDMA  DETECTOR 
FOR  ASYNCHRONOUS  AWGN  CHANNELS 


by 

Zoran  Siveski,  Lizhi  Zhong  and  Yeheskel  Bar-Ness 

ABSTRACT 


A  multiuser  detector  in  an  additive  white  Gaussian  noise,  code  division  multiple  access 
channel  is  proposed  and  analyzed.  The  detector  utilizes  the  decorrelated  tentative  decisions 
obtained  in  the  first  stage  to  estimate  and  subtract  multiple  access  interference  in  the  second 
stage.  The  novel  feature  of  the  detector  is  the  adaptive  manner  in  which  the  multiple  access 
interference  estimates  are  formed,  which  renders  prior  estimation  of  the  received  signal  am¬ 
plitudes  uneccessary.  The  detector  is  near-far  resistant,  and  is  shown  to  provide  substantial 
performance  improvement  over  the  conventional  and  decorrelating  detector,  particularly  in 
the  presence  of  strong  interfering  signals. 

I.  PRELIMINARIES 

In  a  multiuser  environment,  K  users  share  the  same  channel  with  the  unit  energy  signa¬ 
ture  waveform  Sk{t), 

=  1, . . .  A,  of  the  duration  T  assigned  to  each  of  them.  The  information  bits  bk{i)  €  {±1} 
for  the  symbol  interval  i  have  the  same  duration  T.  The  waveform  r[t)  at  the  input  of  the 
receiver  which  has  a  bank  of  matched  filters  as  its  front  end,  is  expressed  as: 

-■(()  =  EE*'  -iT  -  Tk)  +  n{t) 

k=l  i 

where  n{t)  is  a  zero-mean,  white  Gaussian  noise  with  the  two-sided  power  spectral  density 
Ao/2,  and  Ofc  and  are  the  received  energy  and  relative  delay  for  user  k,  respectively.  While 
it  is  assumed  that  precise  relative  delay  estimates  are  available  for  all  users,  their  amplitudes 
are  considered  to  be  unknown  to  the  receiver.  In  reality,  they  are  slowly  time-varying; 
however,  here  they  are  assumed  to  remain  unchanged  over  the  transmission  horizon  for  each 
user. 

Without  a  loss  of  generality,  the  attention  will  be  on  detection  of  bit  0  of  user  1,  and  it 
will  be  assumed  that  0  =  n  <  T2  <  . . .  <  tk  <  T.  The  sampled  output  of  the  matched  filter 
for  user  1  is  then: 

K 

a;i(0)  =  +  X]  \Afc  +  Pikbki^)]  +  ^i(O) 

k=2 
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The  normalized  partial  crosscorrelations  pk\  and  pn,  for  k  =  2, .  .  . ,  K  are: 

rT  rT 

Pki  =  /  s-i_{t)sk{t +  T  -  Tk)dt,  pik=  /  Si{t)sk{t  -  Tk)dt 
Jo  Jo 

Also,  ni(0)  =  n(t)si{t)dt  is  a  zero-mean  Gaussian  random  variable  with  variance  A^o/2 
Using  the  vector  notations  where  =  [p2i,  ■  •  • ,  Pki,  P12,  •  •  • ,  ^>i(0)  =  [62  (-1),..., 

^k(-1),&2(0),  ...  ,^»A'(0)]  ,  Ai  =  diag[y/d^, . . . ,  y/d^],  and  A  =  ^  ^  ,  the  matched 

filter  output  is: 

3^1(0)  =  A  Pi  Abi{0)  +  rai(O)  (1) 

The  multistage  detector  forms  an  estimate  of  the  multiuser  interference  as  the  weighted 
vector  of  tentative  decisions  on  symbols  that  interfere  with  bi{0)  directly,  and  subtracts  it 
from  the  matched  filter  output.  The  final  decision  statistics  pi(0)  and  the  corresponding 
final  decision  for  bit  0  of  user  1,  6i(0),  are: 

Pi(0)  =  a:i(0) -mf (0)61(0),  6i(0)  =  sgn(yi(0))  (2) 

where  6i(0)  =  [62(— 1), . . . ,  ^k(~1),  ^2(0),  •  •  • ,  ^K'(O)]^  is  the  vector  of  tentative  decisions 
affecting  bit  0  of  user  1,  and  lUi(O)  =  [w2i^\  . . . ,  ■  ■  ■  1  '^kiV  ^^.e  corresponding 

weights. 

Such  a  detector  was  first  proposed  in  [1]  and  it  utilized  fixed  weights  that  required  prior 
estimation  of  the  received  signals’  amplitudes.  Several  classes  of  similar  multiuser  detectors 
that  use  soft-decision  tentative  statistics  were  proposed  and  analyzed  in  [2]. 

The  multistage  detector  proposed  here  operates  without  the  knowledge  of  the  received 
signals’  amplitudes,  and  without  the  need  to  perform  their  prior  estimation  in  order  to  set 
the  values  of  the  weights  lUi(O).  Instead,  the  weights  are  determined  in  an  adaptive  manner. 

11.  MULTISTAGE  DETECTOR  ANALYSIS 

In  this  paper,  1-shot  decorrelated  tentative  decision  statistics  are  considered.  This  decor¬ 
relator,  described  in  [3]  for  a  two-user  case,  can  be  extended  in  a  straightforward  manner 
for  an  arbitrary  K.  The  vector  of  the  decorrelator  outputs  Zi(0)  and  the  corresponding 
tentative  decisions  affecting  bit  0  of  user  1  61  (0)  are: 

zi(0)  =  A6i(0) +  ^i,  61(0)  =  sgn(zi(0))  (3) 

where  ^1  =  [i^2(— 1),  •  • . ,  ^A'(  — 1), ^2(0), . . .  ,^fi:(0)]^  is  a  zero-mean  Gaussian  vector  having 
the  covariance  matrix  S 1 ,  whose  diagonal  elements  are  denoted  by  (t|^  . 

For  controlling  the  weights,  a  steepest  descent  algorithm  which  minimizes  the  output 
signal  power  E{yl{0)}  is  used.^  The  weights  that  achieve  this  are  obtained  by  iterative 

®The  synchronous  version  of  the  detector,  and  its  stability  and  convergence  analysis  were  presented  in  [4] 
and  [5]  respectively. 
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search: 

=  t£?i(0)  + /ii;{yi(0)6i(0)}  (4) 

That  is,  the  algorithm,  by  minimizing  the  output  power,  forces  the  correlation  between 

the  desired  output  signal  and  vector  of  tentative  decisions  of  interfering  signals  to  zero.  In 
practice,  the  expectation  operation  is  replaced  by  time  averaging. 

The  steady  state  values  of  the  weights  are  evaluated  as  follows: 

=  £{i,(0)6,(0))-f;{i,(0)d(0))«;.(0)  (5) 

It  is  easy  to  show  that  .E{ni(0)4(i)}  =  0,  for  /  1,  z  =  -1,0,  so  the  first  of  the  two 

expectations  above  is: 

E{xi(0)6a(0)}  =  AE{fei(0)6[(0)}p,  (6) 

Clearly,  £^{6i(0)6[(0)}  is  diagonal;  therefore,  the  system  of  K  -  1  linear  equations  (5), 
together  with  (6),  gives  the  steady  state  values  of  the  weights  affecting  the  first  output  as: 

u,,(0)=  [£{6,(0)6ho))]'’^£^{*’i(0)d(0)}Pi  (7) 

The  expressions  for  the  expectations  in  (7)  are  shown  in  the  Appendix. 

The  output  error  probability  is  evaluated  as  follows: 

Pe. 

=  h.(0).6,(0),6,(o)^’'hi  merror|6.(0).6,(0),6,(0)} 

=  t  ^  [Fr{ni(0)  > 

6i(o),6i(o) 

+  inf  (0)6i(0)}  +  Pr{ni(0)  <  -y/a^  -  pf  A6i(0) 

+  »p0)6i(0)}]  Fr{6,(0)|6,(0))Pr{6,(0)} 

Since  Pr{6i(0)}  =  and  the  above  expression  contains  pairwise  identical  terms  (see 

(A3)  in  the  Appendix),  it  can  be  written  as: 

P,  =  E  [Pr{nx(0)  > 

6i(o),6i(o) 

-pfA6i(0)  +  inf(0)6i(0)}]  Pr{6i(0)|6i(0)} 
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Finally: 


III.  NUMERICAL  EXAMPLES  AND  DISCUSSION 

In  all  the  examples,  the  signal-to-noise  ratio  for  user  k  is  defined  as  SNRk  —  afc/Ao. 
Figure  1  shows,  for  two  asynchronous  users,  the  probability  of  error  of  user  1  versus  SNRi. 
The  relative  energy  ei  is  defined  as  =  jj^s\[t)dt.  The  case  labeled  (1)  in  the  figure 
corresponds  to  a  relatively  weak  level  of  interference.  It  describes  a  rather  unfavorable 
scenario  for  this  multistage  detector  due  to  tentative  decisions  which  are  not  too  reliable.  This 
results  in  the  performance  of  the  detector  at  higher  values  of  SNRi  to  be  somewhat  inferior 
to  the  decorrelator,  whose  performance  is  insensitive  to  the  level  of  interference.  However, 
the  multistage  detector  outperforms  the  conventional  detector.  When  the  interference  is 
stronger  (case  (2)),  due  to  the  reliable  tentative  decisions,  the  multistage  detector  by  far 
outperforms  the  other  two. 

Figure  2  shows  the  probability  of  error  versus  the  relative  interference  energy  for  the 
fixed  SNRi.  The  multistage  detector  clearly  outperforms  the  decorrelator,  and  achieves  the 
single-user  bound  for  the  relative  interference  level  above  5  dB. 

Gold  sequences  of  length  7  [6,  Fig.  4]  are  used  as  the  signature  sequences  in  the  next 
examples.  Figure  4  shows  the  probability  of  error  for  user  1  when  its  energy  and  the  energy 
of  the  interferer  are  the  same.  The  worst  case  and  the  average  error  performance  over  the 
values  of  the  relative  delay  T2  are  shown.  In  this  scenario  the  multistage  detector  outperforms 
both  the  conventional  and  the  decorrelating  detector,  and  its  average  performance,  due  to 
the  good  crosscorrelation  properties  of  the  two  signature  sequences  used,  is  very  close  to  the 
single  user-bound. 

Figure  4  shows  the  error  probability  of  user  1  as  a  function  of  relative  delays  T2  and  Ts, 
with  the  SNR's  of  all  three  users  equal  to  12  dB.  The  probability  of  error  of  the  multi¬ 
stage  detector,  averaged  over  all  delay  pairs  is  7  x  10“®.  (The  corresponding  values  for  the 
decorrelator  and  the  conventional  receiver  are  9.4  x  10“'*  and  1.2  x  10“^,  respectively.) 


And  finally,  Figure  5  shows  the  probability  of  error  versus  the  relative  interference  ener¬ 
gies  for  the  SNRi  fixed  to  8  dB,  and  T2  =  T/T,  rg  =  5T/7.  Again,  the  multistage  detector 
performs  better  than  the  decorrelator  and  approaches  the  single  user  bound  as  the  interfer- 
ers’  energies  increase. 

IV.  CONCLUSION 

A  two  stage  multiuser  detector  is  proposed.  Its  novel  feature  is  the  adaptive  way  in  which 
the  multiuser  interference  estimates  are  formed.  This  detector  does  not  require  prior  esti¬ 
mation  of  the  received  signal  amplitudes,  and  is  therefore  expected  to  perform  better  under 
the  channel  amplitude  varying  conditions.  The  steady  state  error  performance  is  much  bet¬ 
ter  than  that  of  the  conventional  receiver,  and  in  most  cases  outperforms  the  decorrelating 
detector  as  well. 

V.  APPENDIX 

The  elements  on  the  main  diagonal  of  E  |6i(0)6f  (0)|  are  equal  to  unity,  while  the  off- 
diagonal  ones  are: 

E  {kmu)} 

=  E  {sgn  {^/a~khk{i)  +  6(0)  sgn  (\/aI0(i)  +  6(i))} 

=  \  Y.  [^^^{6(0  <  v^6(0^  6(i)  < 

-  Pr{6(0  <  V^h{i) ,  6(i)  > 

=  — 1,0  k,l  =  2,...,K  k^l 

Each  probability  term  in  the  above  summation  defines  four  integrals.  Therefore: 

EMmm  =  Off  x:  /  /  (m) 

where  denotes  the  bivariate  Gaussian  density  function  of  random  variables  6(0 
and  Dm  is  an  appropriate  rectangular  region  of  integration. 

The  diagonal  elements  of  E  |6i(0)6j  (0)|  are: 

k  =  2,...,K  i  =  -l,0  (A2) 

with 
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By  definition: 


Pr{6x(0)|6i(0)} 

=  Pr {sgn  {^/a^b2{-l)  +  C2i-l))  ,■■■  ■, 

sgn  {^/a^bK{-l)  +  ^A'(-l)) ,  sgn  (v^&2(0) 
+6(0)) ,  •  •  •  ,sgn  (V^6a'(0)  +  ^a(O))} 

=  Pr{6(-i)6(-i)  >  -x/^6(-i)6(-i), . . . , 

6<(-1)<^a-(-1)  >  -^/(>l<bK{-^)bKi-l), 

6(o)6(o)  >  — ■s/^6(o)&2(o),  . . . , 

^a'(0)6<(0)  >  — aA^^a'(0)&a'(0)} 

=  Pr{6(-i)6(-i)  <  ^A26(-l)6(-l),  •  •  • , 

^a(-1)(a'(-1)  <  \/^bK{-l)bK{-l), 

6(o)6(o)  <  \/«26(o)62(o),  . . . , 

^a:(0)^a'(0)  <  •v/aA6\'(0)6A'(0)} 

=  Pr{-6i(0)|-6i(0)}  (A3) 
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Figure  1:  Error  probability  of  user  1  for  K=2,  p\2  —  0.2,  p2i  —  0.6,  ei  =  0.4  (1)  02/01  =  0.6 
(2)  01/02  =  0.6 


Figure  2:  Error  probability  of  user  1  for  K=:2,  SNR\  =  8  dB^  pi2  =  0.2,  p2i  =  0.6,  ei  =  0.4 


(w)  worst  case 
(a)  average 


APPENDIX  B:  PART  V 


ADAPTIVE  MULTIUSER  BOOTSTRAPPED  DECORRELATING  CDMA 
DETECTOR  IN  ASYNCHRONOUS  UNKNOWN  CHANNELS 


by 

Yeheskel  Bar-Ness,  David  W.  Chen  and  Zoran  Siveski 


ABSTRACT 


For  PIMRC  uplink  multiuser  CDMA  channels,  received  signals  are  time  asynchronous, 
and  suffer  from  amplitude  fading.  The  relative  signals’  bit  delays,  energy  variations,  and 
hence  the  cross-correlation  matrix  of  the  received  signal,  are  unknown.  Using  a  novel  adap¬ 
tive  algorithm,  we  propose  in  this  paper  a  decorrelating  detector  suitable  for  particular 
CDMA  uplink  channels.  It  is  near-far  resistant,  and  can  be  used  by  itself  if  signal-to-noise 
ratios  are  high,  or  as  a  first  stage  in  the  two-stage  adaptive  canceller  previously  proposed  for 
the  synchronous  CDMA  case. 

I.  INTRODUCTION 

Multiuser  Code  Division  Multiple  Access  (CDMA)  schemes  have  become  a  popular  ap¬ 
proach  for  personal,  indoor,  and  mobile  wireless  radio  communications  (PIMRC).  Con¬ 
ventional  single-user  detectors  implement  a  bank  of  filters,  each  matched  to  the  signature 
sequence  of  the  particular  user.  The  sampled  output  of  each  matched  filter  contains  residual 
interference  from  all  other  users.  In  some  applications,  such  as  military,  the  bandwidth  is 
not  so  limited,  so  that  processing  gain  might  be  very  high  and  the  number  of  users  small 
enough  to  result  in  a  low  level  of  controllable  interference  power.  In  PIMRC  applications 
bandwidth  is  scarce  and  the  number  of  users  is  large  and  continually  increasing.  Particularly 
for  uplink  cellular  systems,  the  level  of  interference  can  be  very  large  (the  near-far  problem); 
communication  is  asynchronous;  and  multipath  reception  is  usual. 

To  handle  this  difficult,  time  varying  communication  environment,  multiuser  detectors 
of  different  structures  have  been  proposed  by  many  researchers.  Some  [1]  are  optimal,  com¬ 
pletely  eliminating  the  near-far  problem,  although  suffering  from  computational  complexity. 
Others  [2,3]  only  suboptimal,  assume  the  knowledge  of  the  cross  correlation  matrix  of  the 
different  user  signature  sequences  and  linearly  transform  the  output  of  the  matched  filter  to 
obtain  uncorrelated  signals  on  which  data  decisions  are  made. 

Another  approach  for  suboptimum  multiuser  detectors  with  low  complexity  was  proposed 
in  [4]  and  [5].  The  performance  of  some  of  these  suboptimum  schemes  is  close  to  the  per¬ 
formance  of  the  optimum  detector.  Particularly  when  the  power  of  the  interferers  increases, 
they  become  indistinguishable.  However,  these  schemes  have  to  perform  an  estimation  of  the 
received  signal  energies,  knowledge  of  which  is  required  for  the  detector’s  proper  operation. 
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At  the  Center  for  Communications  and  Signal  Processing,  research  for  improving  the  per¬ 
formance  of  multiuser  CDMA  detectors  was  directed  toward  using  conventional  and  novel 
adaptive  algorithms  for  interference  cancellation  and  signal  separation.  For  example,  in  [6] 
and  [7],  the  two-stage  multiuser  CDMA  detector  was  used  in  succession  with  linear  trans¬ 
formation,  decorrelation,  pre-decision  (hard  limiter),  followed  by  an  adaptive  interference 
canceler  which  used  the  minimum  power  criterion.  Its  error  performance  was  studied  and 
shown  to  be  significantly  better  than  the  decorrelating  detector.  A  convergence  analysis  of 
this  adaptive  detector  can  be  found  in  [8].  Further  improvement  in  performance,  particu¬ 
larly  at  a  low  interference-to-desired  signal  ratio,  was  obtained  when  the  pre-decision  was 
obtained  from  a  soft-limiter  [9]. 

The  adaptive  interference  canceler  of  the  previously  mentioned  scheme  was  controlled 
by  a  different  algorithm  called  the  “bootstrapped  decorrelating  algorithm”  in  [10,11,12]. 
The  idea  of  using  this  algorithm  is  to  eliminate,  in  future  study,  the  need  for  knowing  the 
cross-correlation  matrix  of  the  different  signals  at  the  output  of  the  matched  filter’s  bank. 

Most  of  the  aforementioned  multiuser  detectors,  adaptive  or  non-adaptive,  assume 
a  synchronous  data  stream  of  all  users  and  knowledge  of  the  cross-correlation  matrix.  A 
suboptimum  multiuser  detector  suitable  for  asynchronous  multiuser  CDMA  environment 
was  first  proposed  in  [4].  It  utilized  fixed  weights  that  required  prior  estimation  of  the 
received  signals’  amplitudes.  Several  classes  of  similar  multiuser  detectors  that  use  soft- 
decision  tentative  statistics  were  proposed  and  analyzed  in  [llj. 

In  a  companion  paper  [12]  the  1-shot  decorrelating  tentative  decision  statistics  of  Verdu 
[13]  are  used.  Assuming  that  the  bit-delay  overlap  between  data  streams  of  the  different  users 
is  known,  and  using  an  AWGN  non-faded  channel,  the  cross-correlation  matrices  with  zero 
and  one  bit-delay  can  be  calculated  and  used  in  a  minimum  power  type  adaptive  canceler 
similar  to  the  one  used  with  the  synchronous  channel. 

For  the  case  of  the  faded  channel,  with  unknown  energies  of  the  received  signal  from 
different  users  and  unknown  bit-delays  of  these  signals,  the  cross-correlation  is  unknown. 
For  this  case,  which  is  typical  for  uplink  PIMRC,  we  propose  in  this  paper  an  adaptive 
decorrelating  detector.  It  might  be  used  as  a  first  stage  for  the  two-stage  canceler  proposed 
in  [6]  and  further  studied  in  [7],  or  by  itself  if  the  SNR  is  high  enough  so  that  merely  the 
decorrelating  detector,  near-far  resistant  scheme  is  satisfactory. 

After  the  asynchronous  decorrelating  model  is  presented  in  the  next  section,  the  equation 
of  the  output  for  the  multiuser  case  is  shown  as  a  function  of  the  adaptive  weights.  A  boot¬ 
strapped  decorrelating  algorithm  for  controlling  these  weights  is  proposed.  For  the  two-user 
case  we  find  these  weights  analytically  and  calculate  the  corresponding  outputs.  We  find  the 
error  probability  analytically  and  depict  it  as  a  function  of  the  interference-to-desired  signal 
ratio.  Results  of  some  simulations  are  shown. 

II.  THE  ADAPTIVE  DECORRELATOR:  MODEL  AND  ANALYSIS 

The  proposed  asynchronous  CDMA  adaptive  decorrelator  consists  of  a  bank  of  matched 
filters  (matched  to  the  users’  signature  sequences),  followed  by  a  sampler  at  the  bit-rate  1/T 
and  controllable  weights  as  shown  in  Fig.  1. 
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Let  K  users  share  the  same  channel  with  the  unit  energy  signature  waveform  Sk{t), 
ic  =  1, 2,  •  ■  • ,  /L  of  duration  T  assigned  to  each  one.  The  received  input  r{t)  can  be  expressed 
as: 

bk{i)y/^Sk{t  -  iT  -  Tk)  +  n{t)  (1) 

k=l  i 

where  bk(i)  G  {+I5  ■“!}  ^he  information  bits  of  the  kth  user  at  interval  i  whose  duration 
is  T.  n{t)  is  a  zero  mean,  white  Gaussian  noise  with  spectral  density  No/2,  ak  and  Tk  are  the 
received  energy  and  relative  delay  for  user  k.  They  are  unknown  to  the  receiver  and  may  be 
taken  as  random  variables.  The  variation  of  ak  depicts  the  amplitude  fading  characteristic  of 
the  different  users.  Tk  may  vary  with  the  change  of  the  user  s  location.  In  fact  each  matched 
filter  has  only  to  lock  onto  the  sequence  of  the  corresponding  signal.  Thereafter  will  be  no 
need  to  know  accurately  the  relative  delays.  The  values  of  these  realtive  delays  will  enter 
into  the  unknown  cross-correlation  matrices  defined  in  the  sequel.  The  adaptive  algorithm 
will  follow  changes  in  cross  correlation  matrices  caused  by  variations  in  relative  delays,  as 
well  as  amplitude  variation  due  to  fading.  Let  us  assume  that  we  have  ordered  the  delays 

such  that  — r/2  <  ti  <  T2  <  •  ■  ■  <  tk  <  T /I. 

The  sampled  output  of  the  matched  filter  for  user  m  at  time  i  (see  Figure  2)  is: 

-  K 

bk{i)^/^kSk{t  -iT  -  Tk)  +  n{t) 

.k=l 
m— 1 

“  y/ (0  +Y  V^ipmkbkii)  +  Pkm’bkii  +  1) 

fc=l 

+  "Y  y/^iPmkbkii)  T  p]cm^bk{i  ~  1)  +  (2) 

k=m+l 

where:  pmk  =  lo  Sm{t)sk{t  -  iT  -  Tk)dt 

ptrl^^  Sm{t)sk{t  ~{i  +  l)T  -  Tk)dt  k  <m 

Pkm^^  ^m{t)sk{t  -{i-  l)T  -  Tk)dt  k>m  (3) 

and  without  loss  of  generality,  we  assume  zero  delay  for  the  corresponding  receiver  matching 
code.  We  also  normalize  pmm  so  that:  pmm  =  fo  —  iT  —  Tm)dt  =  1.  Also  njn{i)  = 

Iq  n{t)sm{t)dt  is  a  zero-mean  Gaussian  random  variable  with  variance  No/2. 

In  matrix  notation  (2)  can  be  written  as: 

x{i)  =  A[{I  +  V)b{i)  +  Plb{i  +  l)  +  Tlb{i-l) 

+  n{i)  (4) 

where:  x(i)  —  [a:i(i),  X2{i), . . . ,  XK{i)]^ 
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b{i)  =  [bi{i), . . . ,  6A-(^)]^,  A  =  diag[y/a^,  ■■■ , 


■  0 

PI2 

■  P\K 

P21 

0 

■  P2K 

.  PK\ 

PK2  ■  ■ 

■  0 

0 

(+1) 

P\2 

0 

dF 

--  p&d 

..  0 

"Pl  and  P[/  are  the  lower  and  upper  triangular  matrices  of  P  respectively,  and  I  is  the  unit 
matrix. 

To  generate  the  decorrelator  input  will  weigh  Xmii),  rn  =  !,•••,  K  such  that: 

m— 1 

ym{i)  =  Xra{i)  “  {'^kmXk{i)  +  WkmXk{i  +  l)) 


'y  ^  “H  ^kmX k{'^  1)) 

k—m+l 

or  in  matrix  notation: 

y{i)  =  x{i)  —  W'^x{i)  -  W^x{i  -  1)  -  W^x{i  +  1) 
where:  y{i)  =  [yi{i),y2{i), . . .  ,yKii)V 


W  = 


0  Wi2 

W21  0 


w  = 


0  W-12 

W2l  0 


and  W L  and  Wu  are  the  lower  and  upper  triangular  matrices  of  W  respectively,  after 

decision: 

b(i)  =  sgn[y{i)]. 

For  controlling  the  weight  matrices  W  and  W,  we  use  the  steepest  descent  algorithm, 
which  simultaneously  reduces  the  absolute  value  of  the  correlation  between  the  decorrelator 
output  ym{i)  and  all  other  outputs  of  the  decorrelator  (after  decision)  at  time  i,  z  —  1  and 
i  +  1  respectively.  The  weights  and  Wmk  for  all  1  <  A:,  m  <  A',  k  ^  m  axe  controlled  by: 

Wmk  Wmk  +  yE{ym{i)sgn{yk{i))} 
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W^k  ^  Wmk  +  liE{ym{i)sgn{yk(i  -  1))}  k>m 

Wmk  ^  Wmk  +  yE{ym{i)sgn{ykii  +  1))}  k  <m  (7) 

Here  we  have  2K{K  -  1)  weights  to  be  controlled,  twice  the  number  we  need  for  the 
synchronous  case  (An  example  of  K=2,  for  finding  the  steady  state  values  of  Wi2i  W211  W\2i 
W2\  will  be  given  in  the  next  section). 

In  the  general  case,  the  signal-to-noise  ratios  (SNR  s)  are  large  enough  such  that  the 
main  contribution  to  the  output  errors  is  due  to  the  multiuser  interference.  Therefore. 

E{ym{^)^g'^{yk{^))] 

E{l/™(^)fefc(^)}(l  -  2Pr(sgn(yfc(f))  in  error))  (8) 

and  similarly  for  the  other  terms.  With  these  approximations  whose  validity  was  examined 
in  [10],  we  can  find  the  steady  state  values  of  Wmk  and  Wmk  1  <  k,m  <  K ,  m  ^  khy  equating 
E{ym{i)bk{i)},  E{ym{i)bk{i  -  1)}  and  E{ym{i)bk{t  +  1)}  to  zero,  respectively. 

III.  THE  TWO-USER  CASE 

For  this  case  (2)  becomes: 

Xi{i)  =  \/a\bi{i)  +  pi2\/^b2{i)  +  P2i\/^^2(*  —  1)  +  ?^i(0 

X2{i)  =  \/^b2{i)  +  Pi2\/Eibi{i)  +  P2i^/E[bi{i  +  1)  +  n2{i)  (9) 

and  the  decorrelator  output  ym{i)-,  m  =  1,2  from  (5)  becomes, 

yi{i)  =  Xi(i)  -  Wi2X2{i)  -  W2iX2{i  -  1) 

^2(0  =  ^2(0  ”  W2lXi{i)  -  Wi2Xi{i  +  1)  (10) 

A.  Algorithm 

Correspondingly  equations  (7)  become: 

W12  <-  ^<^12  +  pE{yi{i)sgn{y2{i))] 

W21  ^  W21  +  pE{y2(i)sgn(yi(i))} 

W21  <-  W21  +  pE{y2(i)sg7i(yi(z  +  1))} 

n;22  <-  W22  +  ME{y2(i)sgn(yj(i  -  1))}  (11) 

These  are  four  non-linear  equations  that  must  be  solved  to  get  the  four  different  weights. 
With  the  approximation  of  (8)  the  following  four  linear  equations  should  be  solved  instead. 

E{yi(z)b2(z)}  =  0  E{y2(^)&l(^)}  ==  0 

E{yi(z)b2(z  -  1)}  =  0  E{y2(iMi  +  1)}  =  0  (12) 
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B.  Optimal  weights 
Equation  (12)  leads  to: 

Pl2V^“  ^21\/a2  =  0  =>  W21  =  Pl2 

/3l2^/al  “  =  0  Wi2  =  P12 

P2i\/d2  -  hi2iV^  =  0  h;2i  =  P21 

P21a/^  -  hil2\Al  =  0  t0l2  =  P21  (13) 

C.  Optimal  Outputs 

Substituting  (13)  in  (10)  together  with  (9)  we  get: 

yi{i)  =  \/ai^i(*)(l  -  P?2  -  d2i)  -  di2p2i\/a7^i(«  -  1) 

—  Pl2P21\/a7^l(*  +  1)  +  1^1  (14) 

where  ,^1  =  ni(i)  -  pi2?^2(0  -  p2in2{i  -  1) 

Noting  that  ni(z),  n2{i)  are  correlated,  we  calculate  in  Appendix  A  al  as:  =  (1  - 

dl2  “  pIi)^ 

The  output  yi{i)  from  eq.(15)  depicts  the  fact  that  in  the  steady  state  the  decorrelator 
output  is  multiuser  interference  free,  and  hence  it  is  near- far  resistant.  However,  it  contains 
inter-symbol  interference  (ISI).  The  latter  can  be  eliminated  by  a  suitable  equalizer  custom¬ 
arily  used  with  fading  channels. 


D.  Probability  of  Error 

The  probability  of  error  of  detecting  bi{i)  from  yi{i)  is  given  (without  the  equalizer)  by 
(See  Appendix  B)  as; 


+  Q 


(15) 


IV.  NUMERICAL  AND  SIMULATION  EXAMPLES 


Fig.  3  depicts  the  probability  of  error  of  Eq.(15).  A  constant  SNRi  of  8dB  was  used, 
pi2  =  0.6,  and  p2i  =  0.2  was  assumed  in  these  computations,  SNR2  was  varied.  For  compar¬ 
ison  this  curve  contains  the  results  of  simulation  with  and  without  equalization.  In  Fig.  4 
we  show  the  error  probability  of  user  one  when  SNRi  and  SNR2  are  varied  from  4  -  20  dB. 
For  comparision  the  curve  contains  the  result  of  computation  when  the  cross-correlation  is 
known  and  linear  transformation  [13]  is  used. 

V.  CONCLUSION 

An  adaptive  multiuser  decorrelating  detector  for  asynchronous  CDMA  was  proposed  in 
this  paper.  The  signals’  energies  and  the  relative  delays  between  the  data  streams  are  as¬ 
sumed  unknown.  Hence,  it  is  suitable  as  a  model  for  asynchronous  fading  PIMRC  uplink 
channels.  Using  a  novel  bootstrap  blind  decorrelating  algorithm,  it  is  shown  that  its  steady 
state  leads  to  a  total  near-far  resistant  decorrelation.  Computation  as  well  as  simulation 
support  this  conclusion.  Although  the  two-user  case  was  used  as  an  example,  it  can  be 
extended  to  the  multiuser  case.  The  decorrelating  detector  can  be  used  as  a  first  stage  for 
the  two-stage  canceler  previously  proposed  [6-7]  when  dealing  with  uplink  PIMRC  channels. 


VI.  Appendix  A 


-  2pi2n\{i)n2{i)  —  2p2\nx{i)n2{i  -  1) 

+  2pi2P2\ri2{i)n2{i  -  1)} 

E{ni[i)n2{i)]  =  Pi2^,  E{ni{i)n2{i  —  1)}  =  p2\^i  and  E{n2{i)n2{i  —  1)}  =  0.  Hence: 

VII  Appendix  B 


Pei  =  ^Eb,(i-i)b^(i+i){Pr{yi{i)  >  0\bi{i)  = -1} 

-F  Pr{j/i(z)  <  0|6i(i)  =  1)} 

=  —  Pi2  —  P21) 

+  Pi2P2i'/^W{'i-  -  1)  +  Pi2p2iy/^bi{i  +  1)}} 

=  >  V^(l  “  /’12  -  -  2pi2P2l)} 

+  2Pr{^-i  >  —  p^2  ~  P21)} 

T  Pr{^i  >  ■^/ai(l  —  pi2  ~~  P21  T  2pi2P2i)}) 
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+ 

+ 


2Q 


Q  ( ~  ^12  ~  P2I  ~  2p\2P2l)\ 


V 


\/^(l  Pl2  P21) 


Q  [ "  Pl2  ~~  P2I  +  2pi2P2l) 

^  V 
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Figure  1:  Proposed  detector 
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Figure  2:  Asynchronous  data  bits 
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Figure  3:  Probability  of  error  of  user  1  for  SNRi  =  8  dB,  pi2 
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CONVERGENCE  AND  STABILITY  ANALYSIS  OF  A  SYNCHRONOUS 
ADAPTIVE  CDMA-BASED  PCS  RECEIVER 


by 

Bin  Zhu,  Nirwan  Ansari,  Zoran  Siveski  and  Yeheskel  Bar-Ness 


ABSTRACT: 

A  thorough  investigation  on  the  convergence  and  stability  of  a  recently  proposed  adaptive 
synchronous  CDMA  receiver  is  presented  in  this  paper.  The  receiver  consists  of  a  decorre¬ 
lator  at  the  first  stage  and  an  adaptive  interference  canceler  at  the  second  stage.  By  using 
a  steepest  descent  algorithm  to  adaptively  control  the  weights,  the  knowledge  of  the  users’ 
received  powers  is  no  longer  required.  It  was  shown  that  the  system  has  good  “near-far” 
resistance,  and  can  approach  optimum  performance  when  the  users’  SNRs  are  high.  Suffi¬ 
cient  conditions  for  the  receiver  to  achieve  convergence  are  derived,  and  their  properties  are 
analyzed. 

I.  INTRODUCTION 

An  optimum  receiver  in  the  multiuser  environment  which  performs  significantly  better  than 
the  conventional  receiver  and  also  eliminates  the  infamous  near-far  problem,  albeit  at  the 
expense  of  high  computational  complexity,  was  first  proposed  in  [1].  Several  suboptimum 
receivers  have  been  proposed  to  achieve  the  reduced  computational  complexity  at  the  cost  of 
lower  performance  [2],  [3],  The  suboptimum  two-stage  decorrelating  receiver  analyzed  in  [3] 
is  one  of  them.  The  performance  of  some  of  these  suboptimum  schemes  is  close  to  the  per¬ 
formance  of  the  optimum  detector,  particularly  when  the  power  of  the  interferers  increases, 
they  become  indistinguishable.  However,  these  schemes  have  to  perform  an  estimation  of  the 
received  signal  energies,  knowledge  of  which  is  required  for  the  detectors’  proper  operation. 
In  [4]  the  detector  for  synchronous  code  division  multiple  access  (CDMA)  systems  that  con¬ 
sists  of  a  decorrelator  and  a  nonlinear  multidimensional  interference  canceler  whose  weights 
adaptively  adjust  to  the  incoming  signal  was  proposed,  and  its  steady  state  error  perfor¬ 
mance  analyzed.  It  was  shown  that  it  provides  significantly  better  performance  than  the 
decorrelating  detector.  This  is  especially  true  under  the  most  critical  conditions  in  multiuser 
environments,  such  as  near-far  situations  and  high  bandwidth  efficiency  utilization.  In  the 
presence  of  strong  interference,  the  proposed  detector  was  shown  to  achieve  the  performance 
of  the  optimum  detector.  In  the  presence  of  weak  interfering  signals  it  performs  better  than 
its  non- adaptive  counterpart  in  [3],  one  that  requires  prior  estimation  of  the  received  signal 
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energies. 

This  paper  investigates  the  learning  schemes  employed  in  this  adaptive  detector  and  its  tran¬ 
sient  behavior.  In  Section  II,  the  system  model  is  briefly  reviewed  and  the  learning  scheme 
that  appears  to  be  most  appropriate  is  proposed  and  analyzed.  Simulation  examples  that 
demonstrate  the  fast  convergence  of  the  proposed  scheme  are  presented  in  Section  III. 

II.  DETECTOR:  MODEL  AND  ANALYSIS 

The  adaptive  synchronous  CDMA  receiver  proposed  in  [l]  consists  of  a  bank  of  filters 
(matched  to  the  users’  signature  sequences)  which  comprises  the  front-end,  followed  by  sam¬ 
plers  and  the  decision  system,  as  depicted  in  Fig.  1. 

The  received  signal  r{t)  is  expressed  as: 

K 

r{t)  =  YlYlbk{i)y/^Sk{t -iT)-\-n{t),  (1) 

k=l  i 

where  hk{i)  E.  {  — 1,+1}  is  the  k-th  user’s  data  bit  in  the  z-th  time  interval  and  n{t)  is  the 
additive,  zero  mean,  white  Gaussian  noise  with  the  two-sided  power  spectral  density  No/2. 
The  received  energy  of  the  k-th.  user  signal,  unknown  to  the  receiver,  is  denoted  as  a^.  The 
signature  sequence  Sk{t),  of  the  same  duration  T  as  a  data  bit,  is  known  to  the  receiver.  The 
sampled  outputs  of  the  bank  of  matched  filters  in  the  z’-th  bit  interval  can  be  expressed  as: 

x{i)  =  VAb{i)  4-  n{i).  (2) 

For  the  sake  of  convenience  the  index  z  will  be  omitted  whenever  possible  in  the  text.  In 

(2),  X  =  [.ri, . . . ,  b  —  [6i, . . . ,  6a']^,  A  —  diag[y^, . . . ,  is  a  diagonal  matrix 

and  n  =  [rzi, . . .  The  {k,j)th  element  of  the  symmetric  crosscorrelation  matrix  "P  is 

defined  as:  ^ 

pkj  —  /  k.j j  G  (1,2,..., A^) 

Jo 

with  pkk  =  1-  The  covariance  matrix  of  a  zero  mean  Gaussian  noise  vector  n  is 

E{nn'^}  =  yP 

The  outputs  of  the  decorrelator  are  given  as: 

z  =  P~^®  =  Ab  -f  P“^n  =  Ab  -f-  Fn  =  Ab  +  (3) 

where  F  is  the  inverse  of  P,  and  ^  —  Fn 
the  decorrelator  defined  as: 

b  : 

Then  the  canceler’s  output  is  given  by: 

y  =  X  —  W^i),  (5) 


Let  b  —  1^61, ... ,  be  the  decision  output  of 
sgn(z).  (4) 
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where 


0  Wi2  .  .  .  WiK 
W21  0  ...  W2K 

W=  .... 

WKl  WK2  ■■■  0 

The  output  for  the  A;-th  user  can  be  expressed  as: 

yk  =  Xk-  wjbk,  (6) 

where  Wk  is  the  A:-th  column  vector  of  W  with  the  element  Wkk  deleted,  and  is  the  vector 
obtained  from  6  by  deleting  the  element  bk-  For  controlling  the  weights  we  use  the  steepest 
descent  algorithm,  which  simultaneously  minimizes  the  output  signal  powers  E{yl}.  That 
is,  for  the  k-th  output,  the  optimum  weights  are  obtained  by  an  iterative  search 

d 

-  A  ''T 

=  Wk{i)  +  yE{xkbk  -  bkbk ^^(2)} 

=  {I  -  iiE{bkb\])wk{i)  E  yE{xkbk]-  (7) 

Note  that  E{xkbk]  =  wlE{bkbl).  Let  Ck{i  +  1)  =  -  Wk{i  +  1),  then  equation  (7)  can 

be  expressed  as: 

Ck{i  +  l)  =  {I  -  yE{bkbk})ckii) 

=  {I-yE{bkbl}Y+^Ck{0) 

=  (I-yRkY+^CkiO) 

=  Hi+^Ck{0Y  (8) 

where  both  Rk  =  E{bkbl}  and  Hk  =  I  -  yRk  are  {K  -  1)  x  {K  -  1)  symmetric  matrices. 
Since  bk  G  {-1,1},  which  implies  \E{kbj}\  <1  Vi  ^  j,  the  diagonal  elements  of  Rk  are 
equal  to  I’s  and  off-diagonal  elements  range  between  (  —  1, 1).  Likewise,  diagonal  elements  of 
Hk  are  (1  -  ;u)’s,  and  off-diagonal  elements  range  between  {-y,  y).  Equation  (8)  can  simply 
be  expressed  as: 

Ckii)  =  HickiO).  (9) 

It  is  easily  seen  that  the  training  of  the  weights  converges  if  and  only  if 

.limcA:(i)  =  0,  (10) 

which  is  equivalent  to 

(11) 

i—^oo 

Since  iTfc  is  a  symmetric  matrix,  it  can  be  diagonalized  and  all  of  its  [K  —  1)  eigenvalues 
are  real.  So,  there  always  exists  an  orthogonal  matrix  Qj^  such  that 

Hk  =  QkDkQk\ 
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where  Dk  is  a  diagonal  matrix  Dk  =  diag  [Ai,  •  •  • ,  and  \i,i  =  1,  •  ■  ■ ,  K  —  1  are  the 

(K  —  1)  eigenvalues  of  matrix  Hk-  Thus, 

Ck{z)  =  {QkDlQ-,^)ck{0) 

/  \\  0  \ 

=  Qk  ■■■  Q;'ckm.  (12) 

\  0  / 

It  is  clear  that  the  necessary  and  sufficient  condition  for  the  weights  to  achieve  convergence 
and  stability  is 

I  A,  |<  1,  V^.  (13) 

By  applying  the  Gershgorin  theorem  [5,  p.l46]  to  matrix  Hk,  it  is  easy  to  see  that  matrix 
fffc’s  {K  —  1)  real  eigenvalues  lie  in  at  least  one  of  the  {K  —  1)  disks  on  the  complex  plane 
with  all  their  centers  reside  at  (1  —  n,  0)  and  their  radii  as  rj  (j  —  1,2,  ■  ■  ■ ,  K  —  1),  where  rj 
is  the  sum  of  the  absolute  value  of  each  element  of  either  the  jth  column  or  the  jth  row  of 
matrix  Hk  with  the  diagonal  element  deleted: 

r,  =  or  (>4 

i#j 

Since  the  eigenvalues  are  real  values,  they  can  only  exist  on  the  real  axis  and,  therefore,  all 
the  computations  are  limited  in  real  domain.  The  above  discussion  can  also  be  expressed  as: 

|Ai  —  (1  —  /i)|  <rj  Vi,  for  some  j.  (15) 

Since  the  centers  of  all  the  disks  coincide  at  (1  —  n,  0),  all  eigenvalues  must  lie  in  the  largest 
disk  (the  disk  with  the  largest  radius).  Assume  =  maxjj|e,j|,  where  is  the  (i,  j)th 

A  A  X 

element  of  matrix  E{bkbf.  }  (or  Rk),  which  depends  on  the  signal  SNRs  and  cross-correlation 
matrix  p,  then  rj  <  p.{K  —  Therefore,  all  the  eigenvalues  must  lie  in  the  disk 

with  radius  p{K  —  2l)ek^^^.  That  is: 

(1  -  h)  -  1^{K  -  2)efc^„^  <  Ai  <  (1  -  /i)  +  p{K  -  2)efc^„^.  (16) 

Now  that  the  existing  range  of  the  eigenvalues  of  matrix  Hk  is  known,  to  ensure  convergence, 
the  eigenvalues  must  also  lie  between  (—1,1).  This  can  be  guaranteed  if: 

1.  {1  -  p)  +  p{K  -  2)ejt^^^  <  1,  which  leads  to 

(A^-2)efc_<l. 

This  balances  three  elements  of  the  system:  the  number  of  users  that  can  access  the  sys¬ 
tem  simultaneously,  the  SNRs,  and  the  cross-correlation  of  their  signature  sequences. 

2.  (1  —  /i)  —  p{K  —  2)e)t^^^  >  —1,  which  leads  to 

_ 2 _ 

^  ~  1  +  (/"T  -  2)eA;„,„^ 

This  is  the  condition  on  the  learning  rate  p  for  the  system  to  achieve  convergence  and 
stability.  When  condition  1  is  satisfied,  this  condition  implies  that  p  can  be  any  value 
between  (0, 1). 
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These  two  conditions  are  sufficient  for  the  A:th  signal  to  converge.  It  is  easy  to  verify  from 
equation  (9)  that  Cfc(oo)  =  0  when  the  above  two  conditions  are  satisfied,  i.e.,  the  weights 
will  converge  to  the  same  steady  state  no  matter  what  their  initial  values  are. 

III.  SIMULATION  RESULTS  AND  DISCUSSION 

In  this  section,  two  sets  of  Monte-Carlo  simulation  examples  for  the  transient  behavior  of 
the  adaptive  detector  are  shown.  The  first  one  is  a  two-user  case,  while  the  second  one  is 
a  three-user  case.  In  either  case  the  SNR  of  user  1  is  fixed  at  8  dB,  and  the  SNR(s)  of  the 
interferer(s)  are  varied. 

The  steady  state  error  performance  which  was  analytically  evaluated  in  [4]  is  shown  in 
Fig.  2.  The  crosscorrelation  coefficient  is  equal  to  0.7  and  represents  a  high  bandwidth 
efficiency  situation.  In  order  to  gain  some  insight  into  the  transient  behavior  of  the  system, 
the  adaptation  of  values  of  the  weights  as  a  function  of  time  is  observed.  Fig.  3  depicts 
the  average  value  i  the  standard  deviation  of  W21  that  is  updated  according  to  the  rule  in 
(7).  Three  different  SNR2  values  of  2,  8^  and  14  dB  were  considered.  The  learning  rate  is 
(/X  =  0.2),  and  the  expected  value  £]{xkbk}  is  evaluated  as  a  time  average,  using  a  sliding 
window  of  duration  of  2000  steps  (signal  intervals). 

Fig.  4  shows  the  transient  behavior  of  the  corresponding  error  probabilities  for  the  above 
scenario.  It  is  observed  that  the  steady, state  value  of  the  error  probability  is  achieved  very 
quickly. 

In  the  second  set  of  simulation  examples,  three  users  having  Gold  sequences  of  length  seven 
(frequently  used  in  the  literature  [3])  are  chosen  for  signature  waveforms.  The  rationale 
for  such  a  choice  is  that  Gold  sequences  are  regularly  used  in  an  asynchronous  CDMA 
environment  and  the  study  of  its  proposed  synchronous  counterpart  may  provide  a  useful 
indication  of  the  performance  of  the  former.  The  crosscorrelation  matrix  "P  in  this  case  is: 

7-1  3  ■ 

-1  7  -1 

3-1  7 

The  analytically  evaluated  steady  state  error  performance  of  user  1  is  shown  in  Fig.  5.  Fig. 
7  describes  the  updating  of  the  weights  u;2i  and  uiai,  for  ^  =  0.2.  Again,  as  in  the  two-user 
case,  the  steady  state  values  are  reached  fast,  without  oscillations  afterwards.  Fig.  7  shows 
the  probability  of  error  of  user  1,  the  one  that  is  effected  by  the  weights  u;2i  and  W31.  The 
steady  state  value  is  again  reached  very  quickly. 

IV.  CONCLUSION 

It  has  been  demonstrated  analytically  and  by  simulation  that  fast  and  stable  convergence 
is  achievable  for  the  adaptive  code  division  multiple  access  detector  described  in  [4].  The 
desirable  transient  behavior  of  the  learning  scheme  has  enhanced  the  practicality  of  the  de¬ 
tector,  indicating  that  the  steady  state  performance  can  be  attained  in  a  very  fast  manner. 
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Figure  1:  The  proposed  receiver  scheme 
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Figure  3;  Convergence  of  W2t,  in  the  two-user  case  using  the  proposed  learning  scheme  (7) 


mm 


0  500  1000  1500  2000  2500  3000  3500  4000  4500  5000 

iterations 
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Figure  5;  Steady  state  error  probability  of  user  1  in  the  three-user  case  with  SNRi  =  t 
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Figure  6:  Convergence  of  W21  in  the  three-user  case  using  the  proposed  learning  scheme  (9) 
with  n  =  0.2 


Figure  7:  Error  probability  of  user  1  in  the  three-user  case  with  fi  —  0.2 
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APPENDIX  C:  PART  I 


BLIND  DECISION  FEEDBACK  EQUALIZATION  BASED  ON 

DECORRELATION 

by 

Raafat  Kamel  and  Yeheskel  Bar-Ness 

ABSTRACT 

A  new  blind  equalization  algorithm  is  presented  which  is  based  on  decorrelating  the 
equalizer’s  output.  The  algorithm  is  used  with  a  decision  feedback  equalizer.  The 
performance  of  the  algorithm  is  studied  analytically  and  through  computer  simula¬ 
tion.  The  algorithm  is  shown  to  converge  to  the  correct  channel  parameters  irrespect 
of  the  initial  error  rate.  Upper  and  lower  bounds  on  the  probability  of  error  were 
derived  for  the  decision  feedback  equalizer.  While  the  upper  bound  has  been  known 
for  a  long  time,  the  lower  bound  is  presented  here  for  the  first  time.  It  is  shown 
that  the  lower  is  tighter  than  the  commonly  used  “matched  filter”  or  “no  ISI”  bound. 
A  Kalman-type  decorrelation  algorithm  for  blind  decision  feedback  equalization  is 
also  introduced,  which  has  a  higher  convergence  rate  than  the  original  decorrelation 
algorithm. 

I.  INTRODUCTION 

In  digital  communication  the  need  for  blind  equalization  arises  in  when  the  channel  is 
unknown  to  the  receiver,  which  attempts  to  estimate  a  sequence  of  transmitted  data 
without  resorting  to  the  use  of  a  training  sequence.  The  main  problem  of  blind  equal¬ 
ization  is  that  of  finding  an  appropriate  cost  function  (or  equivalent  error  function) 
that  reflects  the  amount  of  intersymbol  intersymbol  (ISI)  introduced  by  the  channel, 
and  which  does  not  involve  the  transmitted  symbols  [1-5].  Optimization  of  the  cost 
function  should  lead  to  minimization  of  the  ISI.  In  other  words,  optimization  of  such 
a  function  should  be  consistent  with  the  minimization  of  ISI. 

The  first  known  blind  equalization  algorithm  was  introduced  by  Sato  [1].  Sato’s 
blind  equalization  algorithm  error  function  was  later  generalized  by  Benveniste,  et 
al,  [3].  Considering  Sato’s  cost  function,  Godard  [2]  then  described  a  class  of  cost 
functions  that  generalizes  Sato’s.  It  is  also  worth  mentioning  that  the  constant  mod¬ 
ulus  algorithm  (CMA)  developed  separately  by  Treichler,  et  al,  [6-7]  is  also  a  special 
case  of  Godard’s  algorithm. 

Bellini,  et  al  [4],  followed  a  different  approach  and  developed  what  they  termed 
“Bussgang  Techniques.”  Based  on  some  assumptions  about  the  equalizer  and  the 
channel  parameters,  they  derived  a  maximum  likelihood  estimator  of  the  reference 
signal.  This  estimator  depends  on  the  type  of  modulation  used  and  the  signal-to- 
noise  ratio  (SNR).  The  aforementioned  algorithms  are  viewed  as  special  cases  of  the 
Bussgang  technique. 


The  above  algorithms  used  non-convex  cost  functions  which,  in  turn,  imply  the 
existence  of  local  minima  to  which  the  blind  equalizer  might  converge.  Some  of 
these  equilibria  may  be  undesirable,  he.,  at  those  equilibria  the  equalizer  will  not 
be  able  to  remove  ISI.  This  was  shown  and  demonstrated  by  Ding,  et  al,  for  the 
Godard  algorithm  [8-9]  and  for  the  Sato  algorithm  [10].  In  [11],  the  ill  convergence 
of  the  Benveniste,  et  al,  algorithms  [3]  was  also  considered,  thus  proving  that  none 
of  the  previous  algorithms  were  globally  convergent.  For  these  algorithms  equalizer 
initialization  becomes  an  important  issue.  One  would  initialize  the  equalizer  away 
from  the  neighborhood  of  the  ill- convergent  minima. 

Verdu,  et  al,  [12]  developed  a  technique  that  insures  global  convergence  of  blind 
equalizers.  The  key  observation  in  [12]  is  that  over-parameterizing  the  blind  equalizer 
is  the  prime  cause  of  ill- convergence,  and  hence  anchoring  (setting  the  first  coefficient 
to  one)  the  blind  equalizer  was  proposed  [12][13].  This,  together  with  using  a  convex 
function,  guarantees  convergence.  Verdu  used  the  minimized  energy  as  a  cost  func¬ 
tion.  Vembu,  et  al.  [13],  used  the  /i  norm  of  the  equalizer  weight  as  a  cost  function, 
which  was  approximated  by  the  Ip  norm  of  the  equalizer  output.  Kennedy  and  Ding 
[14]  applied  the  concept  of  anchored  equalization  to  a  QAM  transmission.  Due  to 
the  complex  nature  of  the  signal  constellation,  they  performed  joint  equalization  and 
carrier  recovery.  This  was  done  by  anchoring  the  sum  of  the  real  and  imaginary  parts 
of  the  center  tap  to  1,  and  using  the  maximum  of  either  the  real  or  the  imaginary 
part  of  the  equalizer  output  as  a  cost  function.  The  cost  function  was  implemented 
using  the  Ip  norm  of  the  real  or  imaginary  part  of  the  equalizer’s  output. 

Another  family  of  blind  equalization  algorithms  that  appears  in  the  literature 
is  based  on  high-order  moments  and  polyspectra  [15]  and  [16].  In  general,  these 
algorithms  give  better  performance  at  the  expense  of  higher  arithmetic  complexity. 
Basically,  they  use  the  received  samples  to  estimate  the  channel  parameters  and 
reconstruct  the  transmitted  data  via  inverse  filtering.  The  computational  complexity 
of  these  algorithms  makes  them  inappropriate  for  on-line  processing. 

A  different  approach  to  blind  equalization  is  based  on  maximum  likelihood  se¬ 
quence  estimation  [17-19].  In  [17],  the  channel  is  estimated  using  the  least  squares 
method.  In  [18]  an  iterative  procedure  was  devised  which  processes  a  frame  of  re¬ 
ceived  data.  A  least  square  technique  followed  by  a  Viterbi  algorithm  (VA)  is  iterated 
for  jointly  estimating  the  channel  and  data  sequence.  A  similar  iterative  procedure 
is  used  in  [19],  except  the  channel  is  estimated  using  the  EM  algorithm. 

The  main  emphasis  of  blind  equalization  has  been  on  the  linear  equalizer  structure. 
We,  on  the  other  hand,  address  another  structure  which  receives  less  attention:  the 
decision  feedback  equalizer.  In  this  paper  we  introduce  a  new  blind  equalization 
criterion  which  will  be  used  in  conjunction  with  a  decision  feedback  equalizer. 

In  this  paper,  we  assume  that  the  original  data  is  independent  and  identically 
distributed  (iid).  This  is  a  valid  and  widely  used  model.  At  the  output  of  the 
channel,  the  data  is  no  longer  independent.  The  channel  introduces  the  correlation 
in  the  form  of  the  ISI.  We  exploit  the  white  noise-like  characteristics  of  the  original 
signal  and  adapt  the  blind  equalizer  using  decorrelation.  This  was  motivated  by  [20] 
who  provided  a  simple  test  to  show  that  an  adaptive  equalizer  has  converged  to  the 
correct  settings.  In  [20]  it  was  shown  that  if  the  input  data  is  binary  and  iid,  then  the 
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decorrelation  at  the  output  of  the  sheer  of  a  decision  directed  equalizer  is  a  necessary 
and  sufficient  condition  for  the  correct  convergence  of  the  equalizer.  In  this  paper 
we  show  that  the  decorrelation  at  the  input  of  a  sheer  is  a  necessary  and  sufficient 
condition  for  the  perfect  cancellation  of  ISI.  Hence  we  use  decorrelation  as  a  criterion 

for  equalization.  _  j  i  • 

This  paper  is  organized  as  follows.  In  section  2,  we  introduce  the  decorrelation 

algorithm  for  blind  equalization.  In  section  3  we  discuss  the  equalization  of  autore¬ 
gressive  channels  (AR).  In  section  4  the  decorrelation  algorithm  is  used  with  the 
decision  feedback  equalizer  to  equalize  a  channel  of  the  finite  impulse  response  type. 

In  section  5  upper  and  lower  bounds  on  the  probability  of  error  of  the  decision  feed¬ 
back  equalizer  in  the  presence  of  additive  white  Gaussian  noise  (AWGN)  are  given. 

Due  to  the  simplicity  of  the  decorrelation  algorithm  one  could  derive  a  Kalman-type 
decorrelation  blind  equalizer,  which  increases  the  convergence  substantially.  This  is 
presented  in  section  6.  Conclusions  are  given  in  section  7. 

II.  THE  DECORRELATION  ALGORITHM 

Consider  an  Al-tap  adaptive  equalizer,  whose  weights  are  denoted  hy  Wi,  1  <t<N. 

These  weights  are  updated  using  the  steepest  decent  method,  whose  update  equation 
can  be  expressed  as 

-/x/(-)  for  i  (1) 

where  /x  is  the  constant  of  adaptation  and  /(•)  is  the  error  function  of  the  algorithm. 

The  roots  of  the  error  function  determine  the  steady  state  of  the  algorithm. 

In  this  paper  we  propose  a  decorrelation  algorithm  which  uses  AkA^-i  as  an  error 
function,  where  Ak  is  the  output  of  the  equalizer  in  the  linear  equalizer  case  and 
the  input  to  the  sheer  in  the  decision  feedback  equalizer  (DFE)  case.  Therefore  the 
weight  update  equation  can  be  written  as 

=  w-  -I-  iiAkAk-i  for  i  =  1,  •  •  • ,  A^. 

In  a  practical  implementation  one  would  replace  the  expectation  by  the  current 
realization,  leading  to  the  stochastic  difference  equation, 

+  nAkAk-i  for  z  =  1,  •  ■  • ,  Af.  (2) 

The  mean  steady  state  equalizer  weight  is  found  by  setting  the  error  function  m 
equation  (1)  (in  our  case  AkAk-^  for  f  =  1,  •  •  • ,  n)  to  zero.  In  the  following  sections  we 
consider  different  channel  models  and  show  that  the  decorrelation  algorithm  converges 
in  the  mean  to  the  channel  parameters  and  hence  cancels  ISI. 

HI.  EQUALIZATION  OF  AUTOREGRESSIVE  CHAN¬ 
NELS 

In  this  section  we  introduce  the  decorrelation  criterion  for  equalizing  an  AR  channel. 

Despite  the  fact  that  the  AR  model  is  not  commonly  used  as  a  channel  model  for 
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equalization  purpose,  it  gives  a  good  insight  to  the  decorrelation  algorithm.  Although 
our  main  emphasis  in  this  paper  is  the  decision  feedback  equalizer,  we  use  the  linear 
equalizer  in  this  section  to  motivate  the  decorrelation  algorithm  presented  in  the 
previous  section. 


3.1  Problem  Formulation 

Consider  an  AR(ra)  channel  driven  by  an  equi-probable  binary  sequence  {h}-  The 
output  Xk  is  given  by 

n 

=  glk  +  X]  ^i^k-ii  (3) 

i~l 

where  g  is  the  channel  gain  and  a’s  are  the  channel  parameters. 

Figure  1  shows  the  anchored  FIR  blind  equalizer.  In  the  previous  section,  we 
mentioned  the  advantages  of  using  an  anchored  equalizer  [12].  The  output  of  the 
anchored  equalizer  is  given  by 

n 


It 

=  gh  +  Yl  “  '^i)  ■ 

i=l 

The  above  equation  can  be  written  as 

Ak  =  g  (h (4) 

\  i=i  / 

In  order  to  show  that  the  equalizer  converges  in  the  mean  to  channel  parameters,  we 
first  derive  some  correlation  results.  Multiply  equation  (4)  by  Ak-{n+i)  and  take  the 
expectation,  to  get 

AkAk-(n+l)  =  g  (  hAk-(n+l)  —  y^,WiIk-iAk-(n+l)  I  +  aiAk-iAk^(n+i) 

\  i=l  )  i=l 

n 

i=l 

The  last  step  follows  from  the  fact  that  are  independent  for  z  =  0, 1,  2,  •  •  •  ,  n. 

It  further  follows  that 

n 

■Tfc J  ^  \  (y.n—i-\-\AkAk—ii 

2  =  1 
=  0 

since  we  require  that  AkAk-i  =  0  for  z  =  1, 2,  •  •  ■ ,  n  at  the  steady  state.  This  proves 
that  AkAk-i  =  0  for  ^  =  n  +  1.  Similarly,  one  can  prove  that  AkAk-i  =  0  for  i  >  n. 
Therefore,  at  steady  state  we  have 

AkAk-i  =  (T^Sii),  (6) 

where  ^(•)  is  the  Kronecker  delta,  and  we  assumed  that  Ak  is  a  wide  sense  stationary 
random  process.  In  other  words,  the  sequence  {Ak}  becomes  white  noise-like. 
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3.2  Steady  State  Analysis 

In  the  Z-domain,  one  may  write  equation  (4)  as 


a-iz  ^  ■ 

- anZ  "■ 

WiZ~^  —  ■  • 

- WnZ-^ 

A{z)  =  gl{z), 


where  A{z)  and  I{z)  are  the  Z-transforms  of  Ak  and  h,  respectively, 
domain,  using  long  division,  the  above  equation  may  be  written  as 


In  the  time 


CO 

i=0 


(7) 


where 


70  =  1 

71  =  “^1  ~ 

72  =  (^2  -  012)  -  WiJi 

73  =  {^3  -  OC3)  -  W2I1  - 


In  =  {Wn  -  Ctn)  -  ^«n-l7l  ” 


Now  multiply  equation  (7)  by  Ak-i  and  take  expectation 

CO 

'^^iAk-iAk-i  =  0. 

i=0 

The  above  result  follows  since  Ak-i  is  independent  of  Ik.  Hence  together  with  equation 
(6),  we  have 

0-171  =  0; 

therefore,  we  get  71  =  0,  he.,  lOi  =  ai.  Similarly,  one  can  show  that  72  =  0,  which 
together  with  71  =  0,  gives  W2  =  0.2 ■  Hence 

Wi  =  ai  for  z  =  1, 2,  •  •  • ,  n. 

Therefore,  at  steady  state,  the  decorrelation  algorithm  results  in  perfect  ISI  cancel¬ 
lation. 


IV.  EQUALIZATION  OF  MOVING  AVERAGE 
CHANNELS 

In  this  section  we  discuss  the  blind  decision  feedback  equalization  of  channels  with 
a  finite  impulse  response.  We  will  assume  that  the  channel  will  not  introduce  any 
precursors,  and  as  a  result  the  DFE  will  include  a  feedback  filter  and  a  decision 
device.  A  typical  DFE  will  include  a  forward  filter  to  cancel  the  precursor  ISI.  This 
filter  is  typically  a  linear  equalizer  followed  by  a  linear  prediction  error  filter.  Since 
many  articles  deal  with  blind  linear  equalization  we  chose  not  to  address  it.  In  a  real 
scenario  one  could  use  the  decorrelation  algorithm  to  adapt  the  feedback  filter  and 
any  of  the  existing  blind  algorithms  for  the  forward  filter. 
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4.1  Problem  Formulation 


The  channel  and  equalizer  model  under  consideration  is  shown  in  Figure  2.  The 
cascade  of  transmit,  channel  and  receive  filters  is  modeled  as  an  FIR  filter  with 
impulse  response 

N 

h{n)  =  1  +  X]  hi6{n  -  i), 

i=l 

where  6{-)  is  the  Kronecker  delta.  In  the  above  equation  we  normalized  relative  to 
the  first  cursor  {ho).  We  also  assume  that  the  input  h  is  a  binary  white  sequence 
with  a  zero  mean.  The  output  of  the  channel  is  thus  given  by 

N 

2  =  1 

We  assume  that  the  channel  is  slowly  time  varying  and  the  receiver  has  perfect  carrier 
and  timing  recovery.  The  channel  post-cursors  {hi,  ■  ■  ■  ,hi\[]  introduce  ISI  on  the 
current  data  symbol  7^.  The  estimated  data  Ak  is  produced  by  passing  Ak  through  a 
sheer. 

Referring  to  Figure  2,  the  input  to  the  sheer  of  the  decision  feedback  equalizer  Ak 
is  given  by 

Afc  =  Xk-A',_iW 

=  (8) 

where  Ak-i  is  the  vector  of  the  past  N  decisions  A'f._i  —  [Ak-i,Ak-2,-''iAk-N] 
(the  prime  stands  for  transpose)  and  b-i  is  the  vector  of  past  transmitted  infor¬ 
mation  bits  l'f._i  -  [7fc_i,/fc_2,---,7fc-iv],  where  h-i  e  {-1,1}  and  P{Ik-i  =  1}  = 
P{Ik-i  =  -1}  =  |.  W  and  H  are  the  equalizer  and  channel  parameter  vectors, 
respectively ;W'  =  [wi,W2,  •  ■  ■ ,  ruAf]  and  H'  =  [/ii,  /i2,  ■  ■  • ,  ^w]- 

In  this  paper  we  will  assume  a  noiseless  situation,  be.,  we  consider  an  arbitrary 
high  signal-to-noise  ratio.  AWGN  will  be  considered  in  the  next  section  For  ideal 
ISI  cancellation,  the  sheer’s  input  Ak  =  Ik  and  therefore  sequence  {Ak}  will  be 
decorrelated,  be.,  AkAk-n  =  0  for  n  /  0.  In  other  words,  decorrelation  is  a  necessary 
condition  for  ideal  cancellation  of  ISI.  In  order  to  be  able  to  use  the  decorrelation  of 
the  sheer’s  input  as  a  criterion  for  controlling  the  feedback  weight  vector  W,  we  must 
prove  that  decorrelation  is  also  sufficient  for  cancelling  ISI.  This  is  what  we  intend  to 
show  in  the  next  section. 


4.2  Sufficiency 

In  order  to  prove  sufficiency,  we  rewrite  equation  (8)  as 

N  N 

Ak  —  Ik  P  ^  )  hilk—i  ^  )  f^iAk—i  •  (9) 

2=1  2=1 
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If  we  denote  the  set  of  all  correct  decisions  by  Ai  and  the  set  of  all  incorrect  decisions 
by  A2,  he., 

Ai  =  {Ai  :  Ai  =  li} 

A2  =  {Ai  :  Ai  =  — /i}, 

then  equation  (9)  can  be  written  as 

Ah  =  hfc  +  ^2  ~  Wi)Ik-i  +  ^  {hi  +  Wi)Ik-i 

i:Ak-ieAi  i-.Ak-ieA^ 

=  h  +  j^%h-i  (10) 

=  h  +  isik, 

where  7i  is  given  by 


7i  = 


{hi  —Wi)  for  all  i  :  Ak-i  G  Ai 
{hi  +  Wi)  for  all  i  :  Ak-i  G  A2, 


and 


N 

isik  = 

t  =  l 


We  can  now  show  that  decorrelation  is  a  sufficient  condition  for  cancelling  ISI.  Mul¬ 
tiply  equation  (10)  by  Ak-i  =  [Afc_i,  Ak-21 "  ‘  1  A/j-v],  the  vector  of  the  past  sheer  s 
input,  to  obtain 


N 

AkAk-i  =  IkAk-i  +  y^,^ih-iAk-i- 

i=l 


By  taking  the  expectation  on  both  sides  of  the  above  equation,  it  can  be  shown  that 
the  above  equation  reduces  to 


AfcAfc-i 

AkAk-2 

AkAk—n 

AkAk-N+1 

AkAk-N 


71  +  7i7i+i 

72  +  7»7i+2 

7n.  A  Ei=l  lili+n 

7iV-l  +  7l7N 
IN 


(11) 


It  is  clear  from  the  last  entry  of  the  vectors  in  the  above  equation  that  7Ar  =  0 
iff  AkAk-N  =  0.  Similarly,  it  follows  from  the  {N  —  l)th  entry  that  if  -jn  =  0,  then 
7a^_i  =  0  iff  AkAk-NAi  —  0-  One  would  thus  start  from  the  bottom  entry  and  use 
back  substitution  to  show  that  7^  =  0  for  z  =  1,  ■  •  • ,  AI  iff  AkAk-i  =  0  for  z  =  1,  •  •  • ,  N . 
It  thus  follows  from  equation  (10)  that  isik  =  0  in  the  steady  state  iff  AkAk-i  =  0 
for  z  =  1,  -  ■  • ,  AI.  This  completes  the  proof  that  decorrelation  is  also  sufficient  for 
cancelling  ISI. 
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4.3  Simulation  Results 

A  non-minimum  phase  channel,  whose  transfer  function  is  given  by 

=  1  -b  0.8^“^  -f  0.4z'^  -  0.6^-^  +  0.2z-\ 

was  used  in  a  simulation  to  demonstrate  the  convergence  of  the  decorrelation  al¬ 
gorithm.  Figure  3  shows  the  learning  curve  obtained  by  ensemble-averaging  over 
200  independent  simulation  runs.  The  blind  DFE  used  had  four  tap  gains  in  the 
feedback  filter.  The  adaptation  constant  fi  used  in  the  simulation  was  set  to  0.01, 
and  the  weights  were  initialized  to  zero.  As  depicted  in  Figure  3,  the  four  weights 
rci,  i  =  1,  2, 3, 4  converge  to  the  correct  channel  parameters. 

The  decorrelation  algorithm  was  also  used  to  equalize  a  non-minimum  phase  chan¬ 
nel  whose  transfer  function  is  given  by 

H{z-^)  =  1  +  0.5z-^  -  1.44Z-T 

In  order  the  demonstrate  the  ability  of  the  decorrelation  algorithm  to  converge  to 
the  correct  weight  irrespective  of  the  initial  condition,  we  initialized  the  equalizer 
weights  with  different  values.  Figure  4  portrays  the  trajectories  for  different  equalizer 
initializations.  It  clearly  shows  that  the  decorrelation  algorithm  always  converges  to 
the  correct  weights. 

V.  ERROR  ANALYSIS 

In  considering  the  probability  of  bit  error  for  the  AWGN  case  with  a  zero  mean  and 
a  variance  of  (T^,  we  follow  the  Duttweiler,  et  al,  approach  [21].  Only  results  will  be 
given  in  this  section,  readers  interested  in  details  are  referred  to  [21]  [22].  An  upper 
bound  to  the  probability  of  error  was  derived  in  [21].  Duttweiler,  et  al,  modeled 
the  errors  in  the  feedback  filter  of  a  DFE  as  a  finite  state  machine.  Using  the  state 
transition  probabilities  (a^,  i  =  1,  -  ■  ■  ,N  where  in  our  case  N  is  the  number  of 
delay  elements  in  the  feedback  filter),  Duttweiler  obtained  an  upper  bound  to  the 
probability  of  error.  Using  the  same  finite  state  machine,  a  lower  bound  was  derived 
in  [22]  which  was  tighter  than  the  commonly  used  “no  ISI”  bound. 

It  was  shown  in  [21]  that  the  probability  of  error  q  is  given  by 

q  =  Rj^ , 


where 

N-2  i  N-1 

rn  =  1 + X]  n  n (^2) 

1=0  m=0  m=0 


In  order  to  calculate  the  probability  of  error,  one  has  to  know  all  the  state  transition 
probabilities,  a,-.  This  is  not  feasible  for  large  N.  Instead,  one  can  use  lower  bounds 
on  the  transition  probability  to  derive  an  upper  bound  on  the  probability  of  error. 
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In  [22]  we  derived  expressions  for  the  lower  and  upper  bounds  on  the  transition 
probabilities,  then  obtained  respectively  an  upper  and  lower  bound  on  the  probability 
of  error  of  the  blind  decision  feedback  equalizer. 

The  bounds  on  the  transition  probabilities  are  given  below  (details  of  the  deriva¬ 
tion  are  omitted). 


«  >  ^  ^  —  1  —  2hjn+l  +  ^m+l  ^  Q  ^~1  +  2hm+l  +  l^m+1 

for  0  <  m  <  -  2 

1  ( ^  (  1  2/im+l  l^m+1  \  ^  2/im+l  ~  /^m+l 

““  -  2  ; - 


ajv-i 


for  0  <  m  <  N  —  2 
-1  —  2/i;v'\  Q  /  — 1  +  2/?jv 


OiN  =  Q 


where 


l^m=2  \^i\- 

i-=m+l 

Using  these  results  in  equation  (12),  we  can  obtain  the  lower  and  upper  bounds  on 
the  probability  of  error  in  the  steady  state. 

As  an  example,  consider  the  channel  whose  transfer  function  is  given  by 

=  1  -b  0.8^-^  +  0.6z-2  +  0.4^-^ 

In  this  case,  the  feedback  section  contains  three  delay  elements,  N  =  3.  The  transition 
probabilities  are  given  by 

02  =  ef-U. 
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where  (3i  —  2(|/i2|  +  l/isi)  and  ^2  =  2|/j3|.  Figure  5,  below,  shows  the  lower  and 
the  upper  bound  of  the  probability  of  error.  Also  shown  is  the  probability  of  error 
computed  from  simulation.  The  “no  ISI”  lower  bound  is  also  shown,  and  one  can  see 
that  the  lower  bound  derived  here  is  tighter  than  the  “no  ISI”  bound. 


VI.  KALMAN- TYPE  DECORRELATION  ALGO¬ 
RITHM  FOR  THE  BLIND  DEE 

A  drawback  of  blind  equalizer  is  the  speed  of  convergence.  It  can  take  several  hundred 
to  several  thousand  symbols  for  the  blind  equalizer  to  converge.  The  speed  of  conver¬ 
gence  of  the  conventional  LMS  equalizer  was  improved  by  using  a  weighted  sum  of 
the  past  squared  errors  [23].  The  resulting  algorithm,  known  as  the  Recursive  Least 
Squares  (RLS)  or  Kalman  algorithm,  improves  the  speed  of  convergence  substantially. 
The  penalty  is  an  increase  in  computational  complexity. 

The  decorrelation  algorithm  described  in  this  paper  uses  a  simple  error  function, 
which  makes  it  easy  to  extend  to  a  Kalman-type  algorithm.  In  this  section  we  show 
how  one  can  improve  the  convergence  of  the  algorithm  by  considering  a  weighted  sum 
of  past  correlations.  In  other  words,  instead  of  decorrelating  instantaneous  realiza¬ 
tions  of  the  output,  we  decorrelate  the  time- average  weighted  samples  of  the  output, 
be.,  we  force 

1=0 

to  zero,  where  A'j  =  [Aj_i,  Aj_2,  •  •  • ,  Aj_„].  Substituting  for  Ak  from  equation  (8) 
and  setting  the  weighted  correlation  time  average  to  zero,  we  get 

•£  X'-’A,  (X,  -  A' Wt)  =  0, 

i=o 

where  we  have  introduced  the  time  index  k  in  the  weight  vector,  since  a  weight  update 
equation  will  be  derived  from  the  above  expression.  The  above  equation  leads  to 

Wfc  =  R-\Dfc,  (13) 

where 

Rt  ^  i:a‘-'a,a' 

J.=0 

and 

i=o 
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6.1  The  Recursive  Matrix  Inversion 

Equation  (13)  involves  the  inversion  of  an  n  x  n  matrix,  Rfc,  and  the  Kalman  formu¬ 
lation  involves  a  recursion  formula  for  the  evaluation  of  the  inverse  matrix.  A  similar 
one  can  be  used  here. 

It  is  important  to  note  that  matrix  Rfc  can  be  obtained  recursively  as 

Rk  =  \Rk-i  +  AkK  (14) 


It  is  known  that  for  any  A  nonsingular  matrix,  and  u  and  v  the  following  is  true: 


/X  1  ^  A  ^uv'A  ^ 

(A  +  uv )  -  A  1  ,  /A-i  • 

^  ^  1  +  v'A 

(15) 

Therefore,  using  equation  (15)  in  equation  (14),  we  get  a  recursive 

formula  for  R-^r: 

„_1  1  /-rj-l  ^  '^k-lAkA'i.R  ^k-l\ 

a1^^  \  +  A[R-h-iAk  )' 

(16) 

Next  define 

Pk  = 

(U) 

and  further  define  the  Kalman  vector  gain  as 

kfc  —  ,  Pfc-iAfc, 

A  +  /ifc 

(18) 

where  the  scalar  Hk  is  given  by 

flk  =  A'^R“^A:-1  Afc. 

Using  the  above  definitions,  one  can  write  equation  (16)  as 

Pt  =  1  (Pi-i  -  kiA'jPn)  .  (19) 

The  vector  D);  can  also  be  obtained  recursively  as 

Dfc  =  ADfc_i  +  XkAk-  (20) 

Using  equations  (13)  and  (17),  we  can  write 

Wfc  =  PfcDfc. 


Therefore,  using  the  recursive  formulae  for  P*,  and  Dfc  from  equations  (19)  and  (20) 
respectively,  we  get 


Wfc 


—  (Pk-i  —  kfcA'^Pfc-i)  (ADfc_i  +  XkAk) 

Pfc-iDfc-i  T  'rXkPk-iAk 

A 
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1 

k/j  A^P/;;_X  I^A:  — 1  ^  Ax;kx;  AX: 

=  Wjt_i  + 

— kfcA'x.Wi_x  —  —Xkfik^k 

=  W,_i  +  kfc  (Afc  -  Aj,Wfc_x) 

=  Wjt-x  +  ^’fckfc,  (21) 


where  is  given  by 

=  {Xk  -  AlWfc_x)  . 

The  order  that  constitutes  the  Kalman  type  decorrelation  algorithm  is  summarized 
below: 

W*:  =  Wfc_x  +  Zk\<.k, 


where 

=  {Xk  -  A;Wfc_x)  . 

The  vector  AT  is  evaluated  by  the  recursions: 


kA;  =  -p- - Pi-lAfc 

^  +  Hk 


where 


fJ-k  = 


k—i  kx;Ayj,P/j_x^  , 

A'x-P/t-x  Afc. 


6.2  Simulation  Results 

A  channel  whose  transfer  function  given  by 

=  1  +  0.5z“i  -  1.44^-^ 

was  used  to  demonstrate  the  improvement  in  the  convergence  rate  when  using  the 
Kalman  approach.  Figure  6  depicts  the  learning  curve  of  the  decorrelation  DFE 
and  the  Kalman-type  decorrelation  DFE,  obtained  by  ensemble- averaging  over  100 
independent  simulation  runs.  Eigure  7  depicts  the  averaged  squared  error  of  the 
decorrelating  DFE  and  the  Kalman-type  decorrelating  DFE.  The  increased  speed  of 
the  latter  is  clearly  shown. 


VII.  CONCLUSIONS 

In  this  paper  we  introduced  a  new  criterion  and  algorithm  for  blind  equalization. 
This  algorithm  is  used  in  conjunction  with  a  decision  feedback  equalizer.  It  decorre¬ 
lates  the  data  sequence  at  the  input  of  the  sheer.  It  was  shown  to  converge  to  the 
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optimum  weights  irrespective  of  the  initial  error  rate.  Examples  for  minimum  and 
non-minimum  phase  channels  supplement  the  proven  analytical  results. 

With  an  adaptation  constant  /i  of  0.01  the  algorithm  converges  after  300  itera¬ 
tions.  The  simulation  also  shows  that  the  algorithm  converges  to  the  optimum  point 
regardless  of  the  initial  setting. 

We  also  used  the  error  model  proposed  by  Duttweiler,  et  a/.,  to  obtain  upper  and 
lower  bounds  on  the  steady  state  probability  of  error  for  a  decision  feedback  equalizer. 
The  upper  bound  is  essentially  the  same  as  that  obtained  by  Duttweiler.  The  lower 
bound  is  tighter  than  the  commonly  used  “no  ISI”  lower  bound. 

We  have  also  derived  a  Kalman-type  decorrelation  algorithm  for  the  blind  DEE. 
This  has  a  higher  rate  of  convergence  than  the  original  decorrelation  algorithm,  with 
the  added  cost  of  an  increase  in  the  number  of  computations. 
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Figure  1:  Blind  Linear  Equalizer 
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Weights 


Adaptive  Control 


Figure  2:  Decision  Feedback  Equalizer  with  Decorrelation  Control 


Figure  3:  Learning  Curve  of  the  BDFE 


Probability  or  bit  error 


Figure  5:  Probability  of  Error 


Figure  6:  Learning  Curves  of  the  Decorrelation  DFE  and  Kalman-type  Decorrelation 
DFE 


Eigure  7:  Residual  ISI  of  the  Decorrelating  DEE  and  Kalman-type  Decorrelation  DFE 
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APPENDIX  C:  PART  II 


ANCHORED  BLIND  EQUALIZATION  USING  THE  CONSTANT 

MODULUS  ALGORITHM 

by 

Raafat  Kamel  and  Yeheskel  Bar-Ness 

ABSTRACT 

Blind  equalization  is  a  technique  of  adapting  an  equalizer  without  the  need  of  a  training 
sequence.  The  constant  modulus  algorithm  (CMA)  is  one  of  the  first  known  blind  equaliza¬ 
tion  algorithms.  The  cost  function  of  the  CMA  exhibits  local  minima,  which  are  the  primary 
cause  of  the  ill-convergence  of  the  CMA.  Anchoring  the  CMA  improves  the  performance  of 
the  CMA  in  terms  of  ill-  convergence.  This  technique  is  used  with  the  linear  and  the  decision 
feedback  equalizers.  It  is  shown  that  the  adaptive  equalizer  will  always  remove  intersymbol 
interference  (ISI)  as  long  as  the  channel  gain  exceeds  a  certain  critical  value. 

I.  INTRODUCTION 

Adaptive  blind  channel  equalizers  eliminate  the  use  of  training  signals  in  digital  communi¬ 
cation  when  their  transmission  is  impractical.  The  problem  of  adaptive  blind  equalization 
is  that  of  finding  an  appropriate  cost  function  (or  equivalent  error  function)  that  reflects 
the  amount  of  intersymbol  intersymbol  (ISI)  introduced  by  the  channel,  and  which  does 
not  involve  the  transmitted  symbols  [1-5].  Optimization  of  the  cost  function  should  lead  to 
minimization  of  the  ISI.  Blind  equalization  algorithms  differ  in  the  choice  of  the  cost  function. 

The  first  known  blind  equalization  algorithm  was  introduced  by  Sato  [1].  Sato’s  blind 
equalization  algorithm  error  function  was  later  generalized  by  Benveniste,  et  al,  [2].  Consid¬ 
ering  Sato’s  cost  function,  Godard  [3]  then  described  a  class  of  cost  functions  that  generalizes 
Sato’s.  A  similar  scheme,  known  as  the  constant  modulus  algorithm  (CMA)  was  developed 
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separately  by  Treichler,  et  al.,  [6-7]  based  on  property  restoral  concept. 

Bellini,  et  al.  [4],  followed  a  different  approach  and  developed  what  they  termed  “Buss- 
gang  Techniques.”  Based  on  some  assumptions  about  the  equalizer  and  the  channel  param¬ 
eters,  they  derived  a  maximum  likelihood  estimator  of  the  reference  signal.  This  estimator 
depends  on  the  type  of  modulation  used  and  the  signal-to-noise  ratio  (SNR).  The  aforemen¬ 
tioned  algorithms  are  viewed  as  special  cases  of  the  Bussgang  technique. 

The  above  algorithms  used  non-convex  cost  functions  which  possess  local  minima  to 
which  the  blind  equalizer  might  converge.  Some  of  these  equilibria  may  be  undesirable,  i.e., 
at  those  equilibria  the  equalizer  will  not  be  able  to  remove  ISI.  This  was  demonstrated  by 
Ding,  et  al,  for  the  Godard  algorithm  [9]  and  for  the  Sato  algorithm  [8].  In  [10],  the  ill 
convergence  of  the  Benveniste,  et  a/.,  algorithms  [2]  was  also  considered,  thus  proving  that 
none  of  the  previous  algorithms  were  globally  convergent.  For  these  algorithms  equalizer 
initialization  becomes  an  important  issue.  One  would  initialize  the  equalizer  away  from  the 
neighborhood  of  the  ill- convergent  minima. 

Verdu,  et  a/.,  [11]  developed  a  technique  that  insures  global  convergence  of  blind  equaliz¬ 
ers.  The  key  observation  in  [11]  is  that  over-parameterizing  the  blind  equalizer  is  the  prime 
cause  of  ill-convergence,  and  hence  anchoring  (setting  the  first  coefficient  to  one)  the  blind 
equalizer  was  proposed  [11-12].  This,  together  with  using  a  convex  function,  guarantees  con¬ 
vergence.  Verdu  used  the  minimized  energy  as  a  cost  function.  Vembu,  et  al  [12],  used  the 
h  norm  of  the  equalizer  weight  as  a  cost  function,  which  was  approximated  by  the  Ip  norm  of 
the  equalizer  output.  Kennedy  and  Ding  [13]  applied  the  concept  of  anchored  equalization 
to  a  QAM  transmission.  The  cost  function  used  was  the  Ip  norm  of  the  real  or  imaginary 
part  of  the  equalizer’s  output. 
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Another  family  of  blind  equalization  algorithms  that  appears  in  the  literature  is  based  on 
high-order  moments  and  polyspectra  [14]  and  [15].  In  general,  these  algorithms  give  better 
performance  at  the  expense  of  higher  arithmetic  complexity.  Basically,  they  use  the  received 
samples  to  estimate  the  channel  parameters  and  reconstruct  the  transmitted  data  via  inverse 
filtering.  The  computational  complexity  of  these  algorithms  makes  them  inappropriate  for 
on-line  processing. 

Maximum  likelihood  blind  sequence  estimation  has  also  been  proposed  [16-19].  These 
approaches  involves  the  use  of  a  blind  channel  identification  scheme  is  conjunction  with  the 
Viterbi  Algorithm. 

A  new  approach  to  blind  channel  identification  was  reported  in  [20-21],  that  exploits  the 
cyclostationery  nature  of  digital  signals.  Interested  readers  are  refered  to  [20-21]. 

In  this  paper  we  will  consider  using  the  CMA  with  an  anchored  equalizer.  This  approach 
will  improve  the  performance  of  the  convergence  property  of  the  original  CMA.  It  is  shown 
that  the  anchored  blind  equalizer  with  the  CMA  (ACMA)  converges  to  the  channel  param¬ 
eters  rendering  zero  ISI  provided  the  channel  gain  exceeds  a  certain  critical  value.  If  the 
gain  drops  below  this  critical  point,  the  algorithm  will  converge  to  a  local  minimum.  This 
problem  can  be  alleviated  by  introducing  a  gain  in  the  equalizer.  The  speed  of  convergence 
of  this  equalizer  will  be  compared  to  that  of  AMEA  [11]. 

This  paper  is  organized  as  follows.  First  we  consider  using  the  ACMA  for  blind  equal¬ 
ization  of  autoregressive  and  moving  average  type  channels  in  sections  2  and  3,  respectively. 
In  section  4,  we  present  simulation  results.  Conclusions  are  given  in  section  5. 
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II.  EQUALIZATION  OF  AUTOREGRESSIVE  CHANNELS 


Consider  an  AR  channel  of  order  n  (Ai?(n)),  the  received  signal  is  given  by 

n 

Tk  -  ga-k  R^OLiTk-i: 

i=l 

where  g  is  an  arbitrary  gain,  and  ftj’s  are  the  AR{n)  channel  parameters.  The  information 
symbols  {aRs)  are  binary,  independent  and  identically  distributed  with  zero  mean  and  unit 
variance.  The  moving  average  (MA)  anchored  equalizer  output  has  its  first  tap  set  to  1,  and, 
therefore,  its  output  is  given  by 

n 

Vk  =  rk  +  Yl  WiVk-i 

n 

=  g^k  +  Y^  {o!i  +  Wi)  Tk-i 

i=\ 

=  gak  +  isik  (1) 

where  rc,’s  are  the  equalizer’s  coefficients  and  isik  is  given  by 

n 

isik  =  Y  (2) 

2  =  1 

The  CMA  exploits  the  fact  that  the  original  constellation  has  a  constant  envelope,  that  is, 
if{|ai|}  =  1  for  all  k.  Therefore  an  appropriate  cost  function  would  be 

J{yk)=  (}yk? -lY ,  (3) 

which  is  minimized  when  the  equalizer  output  has  a  constant  modulus  {E{\yk\}  =  1).  Using 
stochastic  gradient  descent  to  minimize  the  above  cost  function,  the  update  equation  for  the 
CMA  is  given  by 

=  w-  -  grk-i  [yl  -l)yk  for  z  =  1,  •  •  • , n.  (4) 

Figure  1  shows  the  anchored  linear  equalizer  and  the  CMA  control. 


Multiply  the  above  equation  by  a*-  +  w^: 

u;f+^  (ai  +  w^)  =  (ai  +  inf)  -  /i  (a,-  +  mf)  (y^  -  l)  . 

Now  take  the  expectation  of  the  above  conditioned  on  inf; 

a,E{inf+'linf}  +  inf£;{in'f+' |inf }  =  a, inf  +  (inf)'  -  //  (a,-  +  inf)  (y^  _  l)  |u;f}. 

Steady  state  is  reached  when  E{inf+^|inf}  =  inf,  and,  therefore,  we  have 

(oi  +  inf)  £{rfc_iyfc  {yl  -  l)  |inf }  =  0  for  i  =  1,  •  •  • ,  n  and  all  k.  (5) 

The  above  n  equations  determine  the  points  of  equilibrium  of  the  algorithm.  One  would  have 
to  solve  the  above  equations  in  order  to  determine  whether  the  algorithm  would  converge  to 
the  desired  values  (in,-  =  ai)  and,  hence,  cancel  the  ISI  completely.  Instead  we  show  directly 
that,  under  certain  conditions  for  the  gain  y,  equation  (5)  implies  complete  cancellation  of 
ISI. 

Adding  the  above  n  equations  we  get 

(«*  +  ^f)  E{rk-iyk  (yl  -  l)  \wi}  =  0. 

?=1 

Substituting  equations  (1)  and  (2)  in  the  above  equation  results  in 

E{isik  ((yufc  +  isik)^  -  l)  (yajt  +  isik)  \wi  } 

=  E{isik  -  l)  isik  +  {^isil  -  l)  yofc  +  isil)  linf}} 

=  0. 

Now,  with  the  definition  (2),  isik  depends  on  the  previous  data  and  inf  (which  itself  depends 
only  on  the  previous  data  ak-i,i  >  1),  therefore  isik  is  independent  of  the  current  data  a^. 
Using  this  together  with  the  fact  that  both  have  a  zero  mean,  we  get 

E  l^{yl-l)  ykf2{ai  +  Wi)rk-i\wi^  =  E  \isisk{yl  -  l)yk\w'l} 
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=  -  l)  E{isil\w^^]  +  E{isi\\w^^] 

=  (3/  -  1)  E{isil\w^^]  +  E{isi\\w^^}  (6) 


=  0. 


If  “ig^  —  1  in  equation  (6)  is  a  positive  quantity  then  it  can  be  written  as 

E{isil\w’l]  =  ~K^E{isil\w^^]  (7) 

with  K  positive.  However,  both  E{isiWw^'\  and  E{isil\w'-}  being  positive  quantities  implies 

E{isil\w'^}  =  E{isil\w'^}  =  0, 

and  together  with  the  fact  that  the  expected  value  of  isi^  is  0,  we  conclude  that  isik  =  0 
with  a  probability  of  1.  In  sum,  if  the  algorithm  reaches  a  steady  state  then  equation  (5)  is 
satisfied  for  i  =  1,  -  •  ■  ,n,  and  from  equation  (1) 


—  9^k 


for  all  k. 


If,  however,  —  1  is  negative  then  equation  (6)  can  be  written  as 


E{isil\w^}  =  A.^E{tsil\w^}, 


where  =  1  —  3g^,  in  which  case  the  ISI  power  is  not  necessarily  zero.  This  corresponds 
to  a  case  wherein  the  algorithm  converges  to  a  local  minimum,  which  could  be  undesirable. 
We  show  the  existence  of  such  undesirable  equilibria  by  using  a  simple  example. 


2.1  Undesirable  Equilibria 


Consider  an  ARin)  channel  with  one  feedback  tap,  given  by 


d^k  T  OL^k—n- 
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The  equalizer  output  is  then  given  by 


and  the  ISI  term  by, 


gak  +  (tn  +  a)  — n? 


isik  =  (r«  + 


It  is  then  easy  to  show  that 


E{rt}  _  ,2  1  +  5^' 

E{rl]  ^  l-a-’ 


and  by  substituting  equation  (10), 


E{isit\w}  _  o. 


E{isil\w 


Combining  equation  (11)  with  equation  (8),  we  get 


(w  +  a)^  =  A" 


1  —  a* 


g'^{l  +  5q!^) 


=  (1-3,^) 


1  —  Of'* 


g‘^{l  +  5q:^) 


w  =  —a  ± 


/I  -  3g'^  1  - 


g  \|(l  +  5a2)-  ^  ^ 

This  clearly  shows  that  the  weight  w  will  not  converge  to  the  correct  channel  parameter  a. 

In  particular,  following  a  procedure  similar  to  that  of  [9],  we  predict  the  condition  on  g 
and  d  which  results  in  u;  =  0,  and  in  no  ISI  cancellation  (see  equation  (9)).  Setting  to  —  0, 
in  equation  (13),  we  get 

2  _  1  - 
^  3  +  a2  +  20-^  ■ 

Therefore  if  the  gain  satisfies  the  above  equation,  the  equalizer  will  not  remove  ISI.  In 
conclusion,  if  the  condition  3c/2  -  1  >  0  is  guaranteed,  one  would  ensure  that  the  algorithm 
would  always  converge  to  the  correct  channel  parameters.  In  other  words  should  the  channel, 
gain  g  be  less  than  the  algorithm  will  not  converge  to  the  no  ISI  case. 
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The  actual  dependence  of  steady  state  and  ill  convergence  of  the  ACMA  on  the  channel 
gain  g  is  examined  in  the  following  example.  For  the  AR{1) 


rk  -  gak  +  0.6rfc_i. 

The  ACMA  equalizer  is  given  by 


(14) 


yk  =  rk  +  w^^Kk-i. 


(15) 


Using  these  two  equations,  we  plot  in  Figure  2  the  cost  function  J{yk)  =  E{{yl  -  1)^}  as  a 
function  of  w  and  g.  From  this  figure  it  is  clear  that  if  the  gain  ^  ^  =  0.577  then  the 

cost  function  has  a  unique  minimum  ai  w  =  a  =  0.6.  If,  however,  the  gain  g  drops  below 
the  function  will  have  two  minima  and  a  maximum  at  w  =  a  and  therefore  the  equalizer 
will  not  converge  to  the  channel  parameter. 

One  can  alleviate  this  problem  by  introducing  an  arbitrary  gain,  G,  in  the  equalizer.  The 
output  of  the  equalizer  is  then  given  by 

n 

Vk  =  Grk  +  G  ^  WiTk-i 
i~\ 
n 

=  Ggak  +  G  (a,-  +  Wi) .  (16) 

i=l 

Following  a  similar  procedure  as  the  one  above,  one  can  show  that  at  the  algorithm’s  steady 
state,  ISI  is  cancelled  (with  a  probability  of  1)  if  and  only  if 


9  > 


1 

G\/3' 


(17) 


Thus,  one  can  choose  G  appropriately  such  that  condition  (17)  is  satisfied.  In  other  words, 
one  would  choose  G  such  that  for  the  worst  case  g  condition  (17)  is  met.  If  the  worst  case 
is  0.01,  for  example,  one  would  choose  G  > 
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III.  EQUALIZATION  OF  MOVING  AVERAGE  TYPE  CHANNELS 


Consider  a  real  MA  type  of  channel  of  order  ra,  MA(n),  the  received  signal,  is  given  by 

n 

rk  =  go-k  +  Yl 

i=l 

where  g  is  an  arbitrary  gain  of  the  first  cursor  and  /ij-’s  are  the  MA(n)  channel  parameters. 
The  input  to  the  sheer  of  the  decision  feedback  equalizer  is  given  by 

n 

Vk  =  fk-Yj  WiVk-i 
1  =  1 

n  n 

=  90‘k-\-Y  ^i^k-i  -  Y  ^iVk-ii  (1^) 

t=:l  i-\ 

where  tCj’s  are  the  equalizer’s  coefficients.  Figure  3  shows  the  decision  feedback  equalization, 
with  the  control  section. 

Now,  if  we  denote  the  set  of  all  correct  decisions  by  Yi  and  the  set  of  all  incorrect  decisions 
by  Y",  i.e., 

y  =  {vi  ■  Vi  = 

Y  =  {yi  :  yi  = 


then  equation  (18)  can  be  written  as 

Vk  =  90,k  +  Yj  ~  '<^i)yk-i  —  Y^  )yk-i 

i-.yk-if-Y'  i:yk-ieY" 

n 

=  9(^k  +  Y  liVk-i 
1  =  1 


(19) 


where 

and  the  qhs  are  given  by 

li 


"^Slk  —  'y  y  lyk—i  •) 
i=l 

{hi  -  Wi)  for  i  :  yk-i^Yi 

-{hi  +  wg)  for  i  :  yk-ieY2. 
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Applying  the  constant  modulus  cost  function  in  equation  (3)  for  the  DFE,  and  using 
equation  (18)  for  dykjdwi^  we  get  the  following  update  equation 

+  fJ'h-i  {yl  -  l)  ?/fc  for  z  =  1, 2,  •  •  • ,  n.  (20) 

Multiplying  equation  (20)  by 

(yl  -l)yk-  (21) 

As  before,  take  expectation  conditioned  on  wf  to  obtain 

=  u;fE{7i|rof}  +  yE{^,yk-^  [yl  -  l)  yk\w-}.  (22) 

It  is  simple  to  show  that  steady  state  is  reached  (that  is  E{w^'^^\wf}  =  wf)  if  and  only  if 
E{^iyk-i  {yl  -  l)  yk}  =  0  for  all  i  for  which  yk-tcYi  or  yk-itY2. 

Summing  on  i  we  have 

E  {yl  -  l)  =  E{isik  {yl  -  l)  yk\w^i] 

=  0. 

This  is  exactly  the  same  equation  we  have  for  the  AR(n)  channel  case.  In  Figure  4.3  we 
present  a  digital  implementation  of  the  DFE-ACMA  for  an  MA  type  channel. 

IV.  SIMULATION 

In  this  section  we  present  simulation  results  of  the  anchored  constant  modulus  algorithm 
as  it  is  applied  to  the  linear  equalization  for  autoregressive  channels  and  to  the  decision 
feedback  equalization  of  moving  average  type  channels.  In  particular,  we  show  the  effect  of 
the  gain  g  on  the  performance. 


4.1  Linear  Equalization 


Consider  the  AR{1)  channel  whose  output  signal  is  given  by 

rk  —  gak  +  0.6rfc_i 

The  linear  equalizer  taps  weights  are  updates  using 

-  firk-iVk  (yl  -  l)  • 

The  averaged  squared  error  of  this  equalizer  is  given  in  Figure  4 

4.2  Decision  Feedback  Equalization 

To  examine  the  performance  of  the  DFE  we  consider  a  channel  whose  transfer  function 
is  given  by 

H{z-'^)=g  +  0.5z-^ -l.Uz-^  (23) 

with  the  corresponding  adaptation  rule  of  equation  (20).  The  averaged  squared  error  is 
depicted  in  Figure  5. 

In  these  figures,  the  estimate  of  the  residual  ISI  power  is  obtained  by  passing  the  sequence 
of  the  squared  error((afc  -  sgn  {vk))^)  through  a  smoothing  filter  whose  transfer  function 
is  given  by  0.05/ (1  -0.95z~^).  These  figures  show  clearly  that  the  speed  of  convergence 
of  the  ACM  A,  for  the  linear  and  decision  feedback  equalizers,  depends  on  the  channel  gain 
g.  As  the  gain  g  the  approaches  breakpoint,  g  =  the  algorithm  takes  a  longer  time 
to  converge.  We  notice  that  for  1  <  <  0.7,  the  speed  of  convergence  is  nearly  constant, 

reaching  approximately  zero  after  130  iterations,  while  for  g  =  0.6  the  algorithm  converges 
after  250  iterations.  The  ill  convergence  of  the  algorithm  is  also  evident  for  the  gain  of 
5  =  0.5<^. 
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The  performance  of  the  ACMA  is  also  compared  with  the  anchored  minimum  energy 
algorithm  described  in  [11].  In  Figure  6,  we  depict  the  convergence  of  this  algorithm  when 
used  with  the  AR{1)  channel  used  in  equation  (9).  The  adaptation  rule  for  the  AMEA  is 
given  by 

+  nykrk-i 

As  in  ACMA,  the  speed  of  convergence  depends  on  the  channel  gain  g.  However,  being 
globally  convergent,  the  AMEA  shows  no  ill  convergence  for  a  small  g. 

Finally,  in  Figure  7  we  compare  the  rate  of  convergence  of  the  ACMA  with  that  of  the 
AMEA  for  g  =  1.0.  The  ACMA  converges  faster  than  the  AMEA. 

V.  CONCLUSION 

In  this  paper  we  used  the  concept  of  anchoring  the  blind  equalizer  with  the  constant 
modulus  algorithm  for  AR  and  MA  channels.  We  showed  analytically  and  through  simula¬ 
tion  that  the  algorithm  converges  successfully  if  the  unknown  channel  gain  exceeds  a  certain 
value  (:^)-  The  algorithm  will  fail  to  converge  to  the  desired  value  if  the  channel  gain  drops 
below  this  value.  This  problem  can  be  minimized  if  we  introduce  a  gain  in  front  of  the 
equalizer.  Introducing  a  gain  at  the  equalizer  will  not  eliminate  the  problem,  but  it  will 
lower  the  critical  point  below  which  ill  convergence  appears. 
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Figure  3;  The  Anchored  DFE  using  the  CMA 
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Figure  4:  Mean  of  the  Squared  Error  of  ACMA  for  Different  Gain  g 
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APPENDIX  C:  PART  III 


BLIND  MAXIMUM  LIKELIHOOD  SEQUENCE  ESTIMATION  OF  DIGITAL 
SEQUENCES  IN  THE  PRESENCE  OF  INTERSYMBOL  INTERFERENCE 

by 

Raafat  Kamel  and  Yeheskel  Bar-Ness 

ABSTRACT 

A  blind  maximum  likelihood  (ML)  sequence  estimator  for  an  unknown  linear  dispersive 
channels  is  described.  The  estimator  assumes  a  channel  model  with  quantized  parameters. 
Two  trellises,  a  channel  trellis  and  a  data  trellis  are  defined,  to  search  for  the  ML  channel 
and  data  estimates  using  the  Viterbi  Algorithm  (VA).  This  approach  provides  a  good  per¬ 
formance  versus  complexity  tradeoff. 

I.  INTRODUCTION: 

Adaptive  blind  equalizers  are  used  to  combat  intersymbol  interference  in  situations  when 
it  is  not  appropriate  to  use  training  sequence  in  the  adaptation  of  conventional  equalizers  [2]. 
Most  of  the  research  in  blind  equalization  is  geared  towards  finding  suitable  cost  function 
for  the  linear  equalizer.  For  severely  distorted  channels,  linear  equalizers  enhances  noise, 
resulting  in  unsatisfactory  performance.  For  these  channels  blind  decision  feedback  equal¬ 
ization  and  blind  maximum  likelihood  sequence  estimation  (MLSE)  are  prefered.  In  this 
communication  we  consider  the  latter  technique.  It  is  well  known  that  MLSE  outperforms 
both  the  decision  feedback  and  linear  equalizers.  Usually  the  Viterbi  algorithm  (VA)  is  used 
to  compute  the  MLSE  efficiently  [f],  but  not  without  large  computation  complexity. 

For  the  problem  in  hand  joint  ML  estimation  of  the  unknown  channel  and  the  transmitted 
data  sequence  is  needed.  Seshadri  in  [3]  calculated  the  least  stjuare  estimate  of  channel  for 
every  possible  sequence.  The  data  sequence  that  achieves  the  lowest  least  square  error  is  then 
chosen.  The  resulting  complexity  in  this  approach  was  considerably  higher  than  that  of  the 
conventional  VA.  Ghosh  and  Weber  in  [4]  developed  an  iterative  algorithm  that  estimates 
the  channel  parameters.  An  initial  guess  of  the  unknown  channel  is  made  and  the  VA  is  used 
to  estimate  the  ML  of  a  fixed  frame  of  data.  This  frame  is  then  used  to  compute  the  least 
square  fit  of  the  channel.  The  process  is  iterated  till  the  channel  estimate  converges,  the  the 
algorithm  is  switched  to  the  conventional  mode  of  operation.  The  frame  length  needed  was 
long,  which  incerases  the  complexity. 

To  facilitate  the  proposed  approach  of  this  letter,  we  assume  a  quantized  channeT  and 
develop  two  trellises  one  for  the  channel  and  the  other  for  the  data.  The  VA  is  used  to  search 
the  two  trellis  for  the  ML  channel  and  data  sequence  estimate.  The  output  of  one  is  fed  into 
the  metric  calculator  of  the  other.  The  resulting  scheme  offers  a  considerable  reduction  in  the 

^This  is  justified  in  practice,  since  finite  precision  processors  are  used  to  implement  the  algorithm. 
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computation  complexity  compared  to  [3,4].  It  also  prevails  good  complexity /performance 
tradeoff. 

IL  CHANNEL  MODEL  AND  THE  PROPOSED  TECHNIQUE 

The  sampled  output  of  the  channel,  rk,  at  instant  k  is  given  by 

rk  =  I'kh  +  nk,  (1) 


where  R  =  [h,  h-ir  ■  ■ ,  h-L]'  and  h  =  [/lo,  hi,  - 

{hi}f=o  is  the  sampled  impulse  response  of  the  cascoded  transmit,  channel  and  receive 
filters,  assumed  slowly  time  varying,  {/*,_,}  is  the  sequence  of  transmitted  symbols,  which 
are  assumed  identically  distributed  independent  random  variables  and  is  an  additive 

white  noise  sequence  with  Gaussian  distribution.  At  each  instant  the  data  takes  one  of  the 
M  possible  levels  {±1,  ±3,  •  •  • ,  ±(M  -  1)}  with  equal  probability. 

We  first  consider  the  problem  of  estimating  a  sequence  of  N  transmitted  data  symbols 
from  a  sequence  of  channel  outputs  Vk  =  [ri,r2,  •  •  •  rj^]'  for  a  known  channel.  There  are 
equi-probable  sequences  denoted  by  {1(1),  •  •  •  ,I(M^)}.  The  ML  estimator  chooses  the  most 
likely  sequence  1ml  according  to 


Ijv/n  =  argmax/r|i(r|I),  (2) 

where  /r|i(-l-)  is  the  conditional  probability  density  function  (pdf).  Since  {rik}  are  iid  random 
variables,  one  can  write 


/r|l(rll)  =  li  fr,\li^k\lk), 

k=i 

=  n  Un  -  Ilh),  (3) 

k^l 

where  /n(-)  is  the  Gaussian  pdf.  In  principle  the  maximization  in  eq.  (2)  should  take  place 
through  exhaustive  search  over  the  sequences,  which  can  be  carried  out  efficiently  using 
the  VA  [1], 

For  the  unknown  blind  equalization  problem  at  hand,  we  consider  the  conditional  prob¬ 
ability  of  the  received  sequence  conditioned  on  both  the  transmitted  sequence  and  the 
channel  impulse  response.  Assuming  all  channel  realizations  are  equally  probable,  the  joint 
ML  estimate  for  the  transmitted  data  sequence  and  the  channel  parameters  is  given  by, 

(Imz^Iiml)  =  argmax/i.|i,h(rll,h),  (4) 

wherein  the  maximization  is  carried  over  all  the  possible  channel  realizations  and  transmitted 
data  sequences.  Such  problem  is  not  trivial  since  h  is  continuous  and  I  is  discrete. 

While  [3,4]  solved  eq.  (4)  by  finding  the  least  square  estimate  of  the  channel  and  then 
using  that  estimate  in  the  VA,  we  follow  a  different  approach.  The  key  observation  is  that 
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if  the  channel  is  discrete,  one  could  interchange  the  roles  of  data  and  channel  parameters  in 
the  VA  branch  metric.  That  is  if  the  data  is  known  one  would  search  a  channel  trellis  for 
the  ML  channel  parameters  and  vise  versa.  Therefore  we  propose  to  use  two  trellises,  one 
for  data  and  the  other  for  channel.  Two  VAs  are  used  to  search  two  trellises  in  parallel  one 
for  the  data  and  the  other  for  the  channel.  The  output  of  one  would  feed  the  other  and  the 
vice  versa.  This  joint  maximization  would  eventually  converge  to  the  estimate  given  in  eq. 

(4)- 

The  channel  parameters  are  approximated  by  discrete  values  from  the  infinite  alphabet 
{0,  ±c,  ±2c,  •  •  •},  where  c  can  be  chosen  to  be  arbitrary  small.  With  such  a  channel  alphabet 
the  corresponding  channel  trellis  will  have  an  infinite  number  of  states.  However  since  the 
channel  vector  h  does  not  vary  over  each  signaling  interval,  as  the  data  vector  I,  we  need 
not  consider  all  the  levels  of  the  possible  channel  parameters  at  a  given  instant.  In  order  to 
reduce  complexity,  we  propose  a  simple  assignment  scheme  for  the  channel  trellis.  The  next 
channel  estimate  ,  in  the  proposed  scheme  is  given  by 

for  state  0 


and 

=  /i*  ±  c  •  1„  for  state  n  =  1, 2,  •  •  • , 

where  ia  a  vector  of  length  L  are  either  zeros  or  ones.  For  the  special  case  when  1„  =  0,  the 
degenerate  state  0  results.  Clearly  the  number  of  states^  does  not  depend  on  the  parameter 
c  but  on  the  channel  memory  L. 

A  smaller  number  of  states  can  be  used  if  the  vector  1„  is  restricted  to  be  all  zero, 
except  for  the  element  at  the  n  th  location  to  unity,  is  clear  that  the  above  state  assignment 
results  in  L  +  2  states.  The  branches  emerging  from  all  the  states,  except  for  state  0,  have 
2  parallel  transitions,  one  corresponding  to  an  increment  (+c)  and  the  other  to  a  decrement 
(— c).  There  are  other  state  assignment  schemes  with  less  than  2^'^^  states,  but  the  above 
assignment  will  result  in  a  simple  trellis. 

The  algorithm  will  proceed  as  follows: 

•  1.  Start  with  initial  channel  estimate,  h.ML  =  h°- 

•  2.  Use  the  VA  to  solve  for 


Iml  =  argmax/(r|I,hML), 


with  the  branch  metric  {ru  —  Y^iLi  Klk-i^ 
•  3.  Use  the  VA  to  solve  for 


hML  =  argmax/(r|iML,hML), 

h 


with  the  branch  metric  (rk  —  hilk-ij 


^The  maximum  number  of  states  is  2^^^^ . 


161 


•  4.  Iterate  2  and  3. 

It  can  be  noticed  that  the  algorithm  achieves  the  ML  estimate  of  the  channel  by  adaptively 
incrementing  or  decrementing  the  previous  estimate.  Using  the  channel  state  table  above, 
we  change  one  channel  parameter  per  transition.  To  improve  the  speed  of  convergence  one 
can  more  states  in  the  channel  trellis,  that  allows  one  to  change  two  or  more  parameters  a 
time.  This  will  significantly  improve  the  rate  of  convergence  on  the  expense  of  complexity. 
Thus  one  can  compromise  rate  of  convergence  to  complexity. 

Another  parameter  that  affects  the  performance  is  the  step  parameter  c.  Choosing  c 
smaller  will  reduce  the  rate  of  convergence,  but  will  improve  the  error  rate.  This  point  is 
demonstrated  in  the  following  simulation  example. 

III.  SIMULATION  RESULTS 

The  algorithm  described  above  was  used  to  equalize  the  channel  (assumed  unknown) 
whose  sampled  impulse  response  is  given  by 

h{n)  =  0.405  •  6{n)  +  0.817  •  6{n  -  1)  +  0.407  •  6{n  -  2), 

where  ^(•)  is  the  Kronecker  delta  function.  For  simplicity  binary  transmission  is  assumed, 
so  that  L  =  2,  and  therefore  the  channel  and  data  trellises  will  have  4  states  each.  These 
trellises  are  shown  in  fig.  1. 

The  channel  was  initialized  to  h°  =  (0  0  0).  The  bit  error  rate  for  four  different  values 
of  c  were  determined  by  simulation.  The  bit  error  of  the  ideal  MLSE  (with  known  channel) 
was  also  determined.  These  are  depicted  in  fig.  2.  With  c  =  0.01,  the  error  rate  of  the 
proposed  algorithm  closely  follows  that  of  the  ideal  MLSE.  For  larger  values  of  c,  the  error 
rate  deviates  from  the  ideal  case.  However  for  c  =  0.05  case  there  is  less  than  1  dB  loss. 

The  rate  of  convergence  results  are  not  shown  due  to  space  limitations.  However  as  ex¬ 
pected  the  rate  of  convergence  for  the  c  =  0.01  case  was  the  lowest  among  the  four  cases. 

IV.  CONCLUSION: 

In  this  communication,  a  new  algorithm  for  blind  MLSE  was  proposed.  This  algorithm 
approximates  the  continuous  level  channel  model  by  a  discrete  level  one.  A  channel  state 
assignment  scheme  was  presented,  that  leads  to  a  simple  channel  trellis.  This  together  with 
a  data  trellis  are  used  to  find  the  joint  ML  channel  and  data  estimates. 

The  algorithm  offers  a  good  complexity  versus  performance  tradeoff.  One  can  compromise 
complexity  for  faster  convergence  rate  and  lower  error  rate.  The  rate  of  convergence  depends 
directly  on  the  parameter  c.  This  paper  considered  using  a  constant  step  c,  using  a  variable 
c  would  be  valuable  to  acheive  a  higer  rate  of  convergence.  This  point  is  currently  pursued 
by  the  authors. 
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APPENDIX  C:  PART  IV 

REDUCED-COMPLEXITY  SEQUENCE  ESTIMATIION 
USING  STATE  PARTITIONING 


by 

Raafat  Kamel  and  Yeheskel  Bar-Ness 


ABSTRACT 


A  reduced-state  sequence  estimator  for  linear  dispersive  channels  is  described.  The  tech¬ 
nique  is  based  on  partitioning  the  set  of  all  channel  states  in  a  way  that  defines  a  trellis 
with  fewer  states.  The  new  technique  generalizes  trellises  that  were  previously  obtained 
by  Duel-Hallen  and  Heegard,  and  Eyouboglu  and  Qureshi.  While  the  approach  followed 
by  Duel-Hallen  results  in  powers  of  two-state  trellises,  the  new  technique  generates  trellises 
with  an  arbitrary  number  of  states.  Using  the  state  partitioning  approach  one  could  also 
generalize  the  trellises  obtained  by  Eyouboglu.  Therefore,  the  approach  described  in  this 
paper  provides  a  better  performance/ complexity  tradeoff. 

I.  INTRODUCTION 

In  high  speed  digital  transmission  over  bandlimited  channels,  one  of  the  principal  im¬ 
pairments,  besides  additive  Gaussian  noise,  is  intersymbol  interference  (ISI).  Equalization 
is  used  to  combat  ISI.  The  type  of  equalization  used  to  mitigate  ISI  can  be  divided  into 
two  classes.  The  first,  symbol-by-symbol  equalization,  encompasses  linear  and  decision  feed¬ 
back  equalization.  The  second  involves  maximum  likelihood  sequence  estimation  (MLSE) 
[1],  where  the  Viterbi  algorithm  (VA)  is  used  to  search  a  trellis  for  the  desired  sequence. 

While  the  first  class  has  low  complexity  and  a  high  error  rate,  the  second  has  a  lower 
error  rate  at  the  expense  of  complexity.  The  complexity  of  the  VA  grows  exponentially  with 
the  length  of  the  channel  impulse  response.  When  the  impulse  response  becomes  larger,  the 
VA  becomes  impractical,  and  methods  for  complexity  reduction  are  needed. 

Research  has  been  directed  toward  obtaining  reduced-complexity  equalizers,  while  main¬ 
taining  a  performance  as  close  as  possible  to  MLSE.  To  reduce  the  complexity,  a  number  of 
authors  have  proposed  incorporating  a  linear  or  decision  feedback  preprocessor  so  that  the 
MLSE  will  deal  with  an  equivalent  channel  having  a  shorter  impulse  response  [2]  [3].  In  [2], 
a  linear  equalizer  was  used  to  shorten  the  impulse  response  of  the  channel,  while  in  [3]  a 
DEE  was  used  to  truncate  the  length  of  the  channel.  Such  approaches  were  found  to  limit 
the  performance  of  the  combined  system. 

Eyuboglu  and  Qureshi  [4],  and  Duel-Hallen  and  Heegard  [5]  have  proposed  sequence  esti¬ 
mators  which  provide  a  good  performance/ complexity  tradeoff.  The  reduced-state  sequence 
estimation  (RSSE)  of  [4]  is  useful  for  systems  utilizing  a  large  signal  constellation,  while  the 
delayed  decision-feedback  sequence  estimation  (DDFSE)  of  [5],  which  is  a  special  case  of 
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RSSE,  is  suitable  for  channels  with  a  long  impulse  response. 

In  [4],  trellises  with  reduced  number  of  states  are  constructed.  These  states  are  formed  by 
combining  the  states  of  the  maximum  likelihood  estimator  using  Ungerboek-like  set  partition¬ 
ing  principles.  An  ambiguity  arises  in  the  branch  metric  calculation  due  to  the  combination 
of  the  different  states.  A  feedback  detector  is  incorporated  to  resolve  this  ambiguity. 

In  [5]  the  complexity  of  the  VA  is  reduced  by  considering  a  few  states  of  the  channel.  The 
ISI  due  to  the  rest  of  the  states  is  estimated  using  a  feedback  detector  analogous  to  that  of 
the  decision  feedback  equalizer  (DFE).  The  estimated  ISI  is  then  used  in  the  branch  metric 
computation.  As  in  the  DEE,  error  propagation  affects  the  performance  of  the  algorithm. 
The  degradation  due  to  error  propagation  was  found  to  be  less  than  that  of  the  DFE. 

In  [6]  a  sequential  algorithm;  namely,  the  stack  algorithm,  used  for  decoding  convolutional 
codes,  is  adapted  to  ISI  channels.  A  metric  suitable  for  ISI  was  derived  in  [6].  The  stack 
algorithm  offers  an  advantage  in  terms  of  computational  complexity  over  the  VA.  This  comes 
with  an  increase  in  stack  and  buffer  storage.  Reference  [7]  gives  a  good  survey  on  different 
sequential  decoding  algorithms. 

In  this  paper,  a  new  technique  [8]  is  presented  for  reducing  the  complexity  of  the  VA 
for  channels  with  long  memory  and  an  arbitrary  size  signal  constellation.  The  technique  is 
closely  related  to  [4]  [5].  While  [4]  is  suitable  for  a  signal  alphabet  with  large  constellations 
and  [5]  is  suited  for  channels  with  long  memory,  the  new  approach  offers  a  unified  approach 
to  the  problem  of  a  large  signal  constellation  with  an  arbitrarily  long  channel.  It  also  offers 
more  flexibility  in  the  choice  between  performance  and  complexity  than  the  one  in  [5].  It 
can  generate  trellises  with  any  number  of  states  rather  than  only  power  of  M  states  as  in 
[5],  where  M  is  the  alphabet  size. 

The  approach  described  in  this  paper  is  motivated  by  the  work  on  error  propagation  for 
the  DFE  given  in  [9].  Error  sequences  for  the  DFE  can  be  modeled  as  a  Markov  chain, 
whose  number  of  states  is  exponential  in  the  number  of  distinct  magnitudes  of  error  and  the 
number  of  past  decisions  that  influence  the  current  decisions.  The  complexity  of  the  resulting 
systems  is  extremely  high.  To  reduce  system  complexity  Duttweiler,  et  al^  [9]  proposed  a 
reduced  state  machine,  wherein  error  sequences  are  grouped  together  in  a  unique  manner. 
This  grouping  can  also  be  envisioned  as  partitioning  the  set  of  all  possible  error  sequences. 
This  led  to  using  a  state  partitioning  technique  to  reduce  sequence  estimation  complexity. 

This  paper  is  organized  as  follows.  In  section  2,  we  present  the  channel  model  and 
introduce  the  partitioning  approach  for  reducing  the  complexity  of  the  VA  for  the  binary 
transmission  case.  In  section  3,  we  describe  a  general  procedure  to  perform  the  suggested 
partitioning.  An  example  is  given  in  section  4.  In  section  5,  we  discuss  the  probability  of 
error  for  the  reduced  complexity  scheme.  The  technique  is  extended  to  complex  non-binary 
modulation  in  section  6.  Conclusions  are  given  in  section  7. 

11.  CHANNEL  MODEL  AND  THE  PROPOSED  TECHNIQUE 

We  will  consider  channels  with  finite  impulse  response,  whose  discrete  time  model  is  given 
in  Figure  1.  This  model  arises  in  a  quadrature  amplitude  modulation  (QAM)  system  at  the 
output  of  a  sampled,  whitened  matched  filter.  Using  the  D  transform,  the  channel  h[D)  is 
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modeled  as  a  finite  response  filter  (FIR),  and  n{D)  is  a  white  Gaussian  noise  source  with  a 
zero  mean  and  variance  of  The  data  sequence  a{D)  consists  of  symbols  Uk,  which  are 
independent  and  identically  distributed.  In  this  section  we  will  assume  binary  transmission; 
extension  to  complex  and  non-binary  modulation  is  deferred  to  section  6.  The  data  symbols 
Gk  take  values  of  ±1  with  equal  probability.  Referring  to  Figure  1,  the  output  y{D)  is  given 


by 


y{D)=^a{D)h{D)+niD), 


where  h{D),  given  by  h{D)  =  ho  +  hD  -!-•••  +  defines  the  channel  impulse  response, 

whose  degree,  n,  is  determined  by  the  channel  memory.  The  state  of  the  channel,  s(A;),  at 
time  k  is  the  binary  n  tuple  given  by  aA:-2,  ■  •  • ,  ait-n),  tbe  previous  input  data.  At  any 

time  k  there  are  2"  possible  states.  The  set  of  all  channel  states  is  denoted  by  Q;  be.. 


Q,  =  {si  :  Si  is  a  state  of  the  system  ,  i  =  0, 1,  •  •  • ,  2"  —  1). 

In  the  proposed  technique  different  states  are  combined  into  a  smaller  number  of  super¬ 
states.  That  is,  the  set  Q  is  divided  into  iV,  Si,  subsets,  where  iV  is  2  <  A"  <  2",  such 
that 

1. Ulo^Si  =  n 

2.  Si  n  Sj  =  0;  the  empty  set,  for  i  ^  j  and  0  <  i,j  <  N  —  1 

3.  The  subsets  Si  are  chosen  such  that  for  all  G  Si,  the  corresponding  next  state 

+  1)  must  belong  to  one  subset. 

The  first  two  conditions  specify  a  partition  on  the  set  0,  and  hence  one  could  also  specify 
an  equivalence  relation  on  fl.  The  third  condition  is  a  constraint  on  the  partitions  that 
enables  a  trellis  to  be  defined.  It  can  be  shown  that  not  every  partition  on  fl  could  be  a 
candidate;  only  those  that  result  in  a  trellis  are  suitable.  A  procedure  is  devised  for  defining 
partitions  that  satisfy  the  third  condition.  This  is  detailed  in  section  3. 

At  this  point  one  should  emphasize  the  difference  between  the  partitioning  considered  in 
this  paper  and  that  in  [4].  Therefore  a  brief  description  of  the  RSSE  is  in  order. 

Consider  a  signal  constellation,  C,  with  M  points,  the  state  of  the  channel  s{k)  is  given 
by  an  M-ary  n  tuple  {ak-i,ak-2,  ■  ■  •  Afc-n)-  The  RSSE  [4]  associates  each  element  ak-i,  for 
1  <  i  <  n,  with  a  set  partition  C{i)  of  the  constellation,  in  which  the  signal  constellation 
is  partitioned  into  Ji  subsets.  The  number  of  subsets  J,-  ranges  from  2  to  M.  In  [4]  the 
partition  C{i)  is  chosen  as  a  finer  partition  of  C{i  -1-  1)  and  Ji  >  for  1  <  i  <  n  -  1. 
Using  the  constellation  partitioning  described  in  [4],  the  channel  state  s{k)  is  represented  by 
a  sub-state,  defined  by 

tk  —  [®fc— l(l)?  2(2),  ■  ■  '  ,  Xk—n(j^)]i 

where  Xk-i{i)  is  the  subset  to  which  the  transmitted  symbol  Uk-i  belongs.  Now  since  Xk-i{i) 
can  only  assume  one  of  Ji  possible  values,  the  RSSE  will  have  a  total  of  YYi=\  Ji  states  as 
opposed  to  iff”  original  states.  When  Ji  =  M  for  1  <  ilel  and  Ji  =  1  for  I  <  i  <  n,  RSSE 
becomes  DDFSE. 
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The  trellises  generated  by  the  DDFSE  represent  a  subset  of  those  obtained  by  the  RSSE 
technique.  By  using  the  state  partitioning  approach  described  in  this  paper  one  can  obtain 
more  trellis  choices.  Consider  the  special  case  of  a  binary  alphabet,  the  approaches  followed 
in  [4]  [5]  will  result  in  2™'-state  trellises,  m  ranging  from  1  to  N.  On  the  other  hand  the 
state  partitioning  approach  could  generate  trellises  with  an  arbitrary  number  of  states  not 
necessarily  a  power  of  two. 

The  branch  metric  for  the  MLSE  is  given  by  {yk  —  hiak-i  —  akY ■  Since  each  state 
in  the  reduced  trellis  is  a  union  of  two  or  more  channel  states,  an  ambiguity  will  result  in 
the  branch  metric  calculation.  That  is,  the  branch  metric  is  no  longer  uniquely  determined 
by  the  previous/present  trellis  states’  pair.  Similar  to  [4]  [5],  a  feedback  mechanism  is  in¬ 
troduced  to  resolve  this  ambiguity.  The  branch  metric  associated  with  the  reduced  trellis  is 
given  by  {yk  —  I]j=i  Kak-i  —  Sf+i  hiOLk^i  —  akf'^  where  I  <  m  is  determined  by  the  reduced 
trellis.  The  previous  state  estimate  {ak-i-i,  ■  ■  ■  ,(ik-n)  is  stored  in  the  path  history  associated 
with  the  present  state. 

III.  THE  PARTITIONING  PROCEDURE 

A  partition  P  of  a  set  fi  is  a  pairwise  disjoint  collection  of  non-empty  subsets  of  0,  whose 
union  is  0.  It  is  known  that  an  equivalence  relation  in  defines  a  partition  of  n,  and, 
conversely,  a  partition  in  0  yields  an  equivalence  relation.  Given  an  equivalence  relation  R 
in  fl,  let  R{a)  =  {a;  G  0  :  aRx}  for  each  a  €  fl.  R{a)  is  known  as  an  equivalence  class 
of  R  and  is  a  subset  of  The  collection  of  subsets,  P  =  {R{a)  :  a  €  fl},  is  a  partition 
of  fl.  A  collection  of  equivalence  relations  {Ri,  R2,  ■  ■  • ,  Rn]  is  known  as  an  equivalence 
sequence  iff  for  all  1  <  i  <  i  <  n,  and  all  x,?/  G  12  we  have  xRjy  xRiy.  That  is, 
Rn{x)  C  Rn-l{x)  C  •  •  •  C  Rl{x)  C  PL. 

For  the  channel  model  described  in  the  previous  section,  the  states  of  the  channel  are 
given  by  binary  n  tuples.  Consider  the  relation  Ri  given  by:  xR^y  iff  the  first  i  components 
of  the  n-tuples  x  and  y  are  identical,  for  any  states  x  and  ?/  G  O.  It  can  be  shown  that  Ri  is 
an  equivalence  relation.  Therefore  Ri  defines  a  parition  of  f2.  It  can  also  be  shown  that  the 
sequence  {Pi,  R2,  ■  •  ■ ,  Rn}  is  an  equivalence  sequence. 

Figure  2  shows  the  partitions  associated  with  different  Pj  and  the  corresponding  subsets. 
Label  the  subsets  corresponding  to  Ri,  with  binary  i  tuples.  We  refer  to  the  partition  formed 
by  Ri  as  level  i  partition.  It  should  be  noted  that  there  are  2’  subsets  at  level  i,  each  with 
cardinality  2"~L  The  number  of  subsets  in  P  will  determine  the  number  of  states  of  the 
trellis,  that  is  satisfy  the  third  constraint.  We  will  now  show  that  the  equivalence  sequence 
{Pi,  P2,  •  •  •  ,  P„}  can  define  partitions  that  will  result  in  a  reduced  complexity  trellis. 

In  order  to  show  that  the  partitions  associated  with  the  above  equivalence  sequence 
satisfy  the  third  constraint,  we  first  present  it  in  a  mathematical  form. 

Define  two  functions  Fi{x)  and  P_i(x)  as  the  next  state  of  the  channel,  when  the  present 
state  is  x,  and  inputs  are  1  and  —1  respectively.  That  is,  if  x  =  (ui,  02,  •  •  • ,  «„),  then 

Pi(x)  =  (l,ai,---,a„_i) 
and  P-iix)  =  (-l,ai,  •  •  •  ,a„_i). 
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To  meet  the  third  condition,  the  equivalence  relation  R  must  satisfy 


xRy  ==^  Fi{x)RFi{y)  and 

F-i{x)RF-i{y)  for  all  x  and  y  €  0.  (1) 

When  the  above  statements  are  satisfied,  the  functions  Fi{-)  and  T'_i(-)  are  said  to  be 
compatible  with  R.  Equation  (1)  can  also  be  written  as, 

for  all  X  and  y  ^  Cl  y  £  R[x)  =>  Fi{y)  £  R{Fi{x))  and 

E_i(y)£i?(E_i(x)).  (2) 


3.1  Trellises  with  2*  States 

The  partition  P  is  formed  by  considering  all  the  subsets  from  the  zth  level;  in  other  words 
all  the  equivalent  classes  of  the  equivalent  relation  J?,.  We  can  now  show  that  such  partitions 
satisfy  the  third  constraint,  f.e.,  equations  (1)  or  (2).  Consider  the  equivalence  relation  Rf, 
the  set  {Fi{y)  :  y  £  i?i(x)}  for  some  x  e  Cl  is  the  set  of  all  channel  states  that  have  the 
first  i  +  1  components  identical.  The  first  element  is  1  and  the  consecutive  i  elements  are 
identical,  since  the  previous  state  y  £  Ri{x)  .  Therefore,  one  can  write: 

{Fiiy) :  y  e  Ri{x)}  =  Ri+i{Fi{x)) 

C  RiiFi{x)). 

The  second  step  follows  from  the  definition  of  an  equivalence  sequence.  Therefore,  it  can  be 
concluded  that 


xR^y  Fi{x)Ri+iFi{y)  (3) 

Therefore  Fl{x)R^Fl{y)  for  all  x  and  j/  £  0.  (4) 

A  similar  argument  holds  for  that  is, 

xRiy  and  (5) 

F-x{x)RiF-x{y)  for  all  x  and  y  ^  Cl. 

Comparing  with  equation  (1),  we  conclude  that  the  equivalence  relation  Ri  defines  a  parti¬ 
tion  which  satisfies  the  third  condition  and  would  result  in  a  trellis.  Since  the  equivalence 
relation  Ri  partitions  the  set  Cl  into  2*  subsets,  the  resulting  trellis  will  have  2‘  states.  It 
should  be  mentioned  that  these  trellises  are  the  same  as  those  found  by  Duel-Hallen,  et  al. 
[5]  On  the  other  hand,  as  will  be  shown  in  the  next  section,  by  using  the  state  partitioning 
technique,  one  can  find  trellises  with  any  number  of  states,  rather  than  only  power  of  2. 
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3.2  Trellises  with  Number  of  States  not  2‘ 


In  this  case  the  partition  P  is  formed  by  considering  subsets  from  adjacent  levels,  instead 
one  level  as  in  the  previous  section.  This  can  be  done  by  considering  some  level  i  subsets 
and  the  remaining  subsets  are  replaced  by  their  corresponding  subsets  from  level  i  +  1.  This 
will  satisfy  the  first  two  conditions  given  in  section  2.  To  satisfy  the  third  condition  one  has 
to  meet  two  requirements: 


For  all  X  and  y  G 

xRiy 


Fi{x)R,Fi{y)  if  i?i(Ti(a;))  £  P  or 
Fi{x)Ri^iFi{y)  otherwise 


2.  For  all  x  and  y  E 

xRi+iy 


Fi{x)Ri+tFi{y)  if  R^+i{Fi{x))  G  P  or 
Fi{x)RiFi{y)  otherwise  . 


P_i(x)  has  to  satisfy  similar  requirements. 

The  requirements  given  in  (6)  and  (8)  follow  from  equation  (4).  Condition  (7)  also  follows 
from  equation  (4).  Now  since 

Fi{x)Ri^iFi{y)  Fi{x)RiFi{y)  from  definition  of  equivalence  sequence, 


xRi^iy  =>  Fi{x)Ri^iF-i{y)  from  equation  (4)  , 

it  follows  that 

xRi+iy  Fi{x)RiFi{y), 

and  (9)  is  satisfied.  A  similar  argument  holds  for  P_i(a;).  Therefore,  the  partition  P  formed 
by  considering  subsets  from  two  adjacent  levels  results  in  a  trellis.  The  number  of  states  of 
such  a  trellis  will  not  be  a  power  of  2. 

It  is  worth  mentioning  that  partitions  formed  by  subsets  from  non-adjacent  levels  will 
not  form  a  trellis  since  (7)  will  not  be  satisfied.  In  fact, 


xRiy 


is  true  only  for  j  =  i  +  1. 


IV.  AN  EXAMPLE 


F\{x)RjF\{y)  and 

P_i(x)PjP_i(y)  for  all  x  and  y  E  ft 


Consider  the  channel  given  by 


h{D)  =  ho  +  h^D  +  +  hD^. 
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The  above  channel  has  a  memory  of  n  =  3;  therefore,  the  state  can  be  represented  by  binary 
3  tuples  X  =  (a;i,X2,a:3).  One  can  use  up  to  three  levels  of  partitioning,  or  the  equivalence 
sequence  {/?i,  E2, -R3}.  Using  the  notation  in  section  3,  Table  1  gives  different  partitioning 
schemes.  Consider  the  5-state  trellis  given  in  Table  1.  The  set  of  all  states  U  is  partitioned  by 
considering  subsets  /22(0l)i  ^2(1>0)  ^nd  i22(l5l)  from  level  2  and  instead  of  subset  i?2(0?0) 

we  consider  its  subsets  i?3(000)  and  i?3(001)  from  level  3.  Note  that  there  are  ^  ^  ^ 

possible  trellises  with  5  states.  The  last  entry  in  Table  1  is  the  full  complexity  case  of  8  states. 
The  branch  metric  depends  on  the  originating  node  of  a  given  branch.  If  the  originating 
node  of  a  branch  corresponds  to  a  subset  from  level  /  (/  <  n),  then  the  branch  metric  7^  is 
given  by 

l  n 

7fc  =  iUk  hiCik-i)  ■  (1^) 

i=o 

The  trellises  for  the  partitioning  schemes  considered  in  Table  1  are  given  in  Figure  3. 

V.  PROBABILITY  OF  ERROR 

In  this  section  we  will  investigate  the  error  performance  of  the  partitioning  schemes  de¬ 
veloped  in  section  3.  It  was  noted  in  [4]  [5]  [10]  that  the  effect  of  the  error  propagation  is 
minimal  for  moderate  to  high  SNR.  Therefore,  we  will  assume  in  our  derivation  that  the 
effect  of  error  propagation  is  negligible.  In  the  sequel  we  will  consider  trellises  with  powers 
of  two  states,  from  which  other  trellises  will  be  derived. 

5.1  Trellises  with  2®  States 

These  trellises  are  the  same  as  those  derived  in  [5],  and  hence  the  analysis  given  in  [5] 
applies  here.  Nevertheless,  we  will  relate  the  probability  of  error  to  different  partitioning 
levels.  This  will  be  vital  for  the  analysis  of  trellises  with  an  arbitrary  number  of  states. 

Consider  the  trellis  formed  by  the  subsets  from  level  i.  As  noted  earlier,  the  resulting 
trellises  will  have  2®  states,  which  are  represented  by  the  binary  i  tuple.  Consider  the  channel 
at  time  k,  the  channel  state  is  s(fc);  Si{k)  =  (afc_i,  afc_2,  •  •  • ,  afc-*),  let  the  corresponding  state 
estimated  by  the  VA  be  denoted  by  Si{k),  Si{k)  =  (afc_i,afc_2,  ■  •  ■  ,dfc-,).  Following  Forney’s 
approach  [1],  an  error  event  £  is  said  to  occur  between  k  =  ki  and  k  —  ^2,  if  k{ki)  =  Si{ki)^ 
Si{k2)  =  Si{k2)  and  Si{k)  ^  Si{k)  for  all  k,  k^  <  k  <  k2.  Since  Si{k)  =  Si{k)  for  k  =  fci,  ^2,  it 
follows  that 

(dfci-i5  dfcj_2,  •  •  • ,  afcj_2,  ■  •  •  ,  Ofci-i) 

and 

(^^2— 1  5  ■  ■  ■  5  j)  — 1’  2?  '  '  '  )  ®fc2— *)• 

Now  define  the  input  error  sequence  associated  with  the  event  by 

6  —  {®fcl  1  ?  ■  '  ’  ?  } ) 
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where  Ck  =  ak  —  ak-  The  Euclidean  distance  df^S)  of  the  error  event  is  given  by 


k2  /Tmn{k-ki,i) 


k=ky  \  3=0 


In  the  case  of  binary  transmission,  the  probability  of  error  is  upper  bounded  by  [l] 


Pe  <Y.Q  12  ^(e)2" 


where  Ed^  is  the  set  of  all  error  events  having  a  Euclidean  distance  of  d?  and  D  is  the  set  of 
square  roots  of  Euclidean  distances  attained  by  error  events.  The  factor  u;(e)  is  the  number 
of  bit  errors  a  given  error  event  entails,  and  (^(•)  is  given  by 


1 


e  ^  Pdij 


For  moderate  to  high  SNR  the  upper  bound  of  the  probability  of  error  is  dominated  by 
events  attaining  the  minimum  distance,  i.e., 


where  Ki  is  given  by 


Pe  <  KiQ 


e&Ed,  . 


Note  that  we  used  the  subscript  i  throughout  to  emphasize  the  dependence  of  terms  like 
di^in  and  Ki  on  the  level  of  partitioning  i.  Therefore,  to  evaluate  the  upper  bound  on  the 
probability  of  error  for  a  given  level,  one  has  to  determine  di^^^  and  Ki. 

At  lower  SNR  one  can  get  better  bounds  by  considering  the  stack  algorithm  given  in 
[10].  However,  with  the  stack  algorithm  one  has  to  first  find  the  error  state  diagram.  The 
complexity  of  such  a  diagram  becomes  intractable  for  channels  with  a  long  impulse  response. 
Therefore,  we  will  only  consider  events  with  minimum  distances. 

5.2  Trellises  with  Number  of  States  not  2* 


We  showed  in  section  3  how  to  form  trellises  by  considering  subsets  from  adjacent  levels 
rather  than  from  one  level.  Examples  of  these  trellises  were  given  in  section  4. 

To  find  an  upper  bound  on  the  probability  of  error  for  these  trellises,  we  consider  without 
loss  of  generality  the  trellis  formed  by  subsets  from  levels  i  and  z  +  1.  Furthermore  assume 
that  the  trellises  are  formed  by  considering  p  subsets  from  level  i  and  q  from  level  z  +  1  in 
such  a  way  that  the  third  constraint  given  in  section  2  is  satisfied.  At  moderate  to  high 
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SNR,  using  the  total  probability  theorem,  the  probability  of  error  of  such  a  scheme  can  be 
upper  bounded  by 

a  <  (%- )  +  (^) .  (15) 

It  can  be  easily  shown  that  for  g  =  0  the  upper  bound  for  level  i  results,  while  for  p  =  0 
that  of  level  i  +  1  results.  Note  that  in  equation  (15)  the  first  term  dominates  asymptot¬ 
ically,  since  That  is,  at  high  SNR  the  first  term  in  equation  (15)  is  more 

dominant  than  the  second.  Therefore,  the  performance  of  such  a  trellis  would  be  the  same 
as  that  of  level  i  at  sufficiently  high  SNR.  However,  at  moderate  SNR,  the  performance  of 
these  trellises  is  better  than  those  with  2*  states,  i.e.,  trellises  formed  by  considering  subsets 
from  level  i  only.  This  point  is  demonstrated  in  the  following  example.  The  improvement  in 
performance  becomes  more  pronounced  for  longer  channels. 

5.3  Simulation  and  Upper  Bounds 

As  an  example,  consider  the  channel  whose  impulse  response  is  given  by 

h{n)  =  0.7107-(5(n)+0.1421-^(n-l)+0.2132-<5(n-2)+0.1421-^(n-3)+0.6396-(5(n-4).  (16) 

The  above  channel  has  a  memory  of  n  =  4;  therefore,  the  states  can  be  represented  by  binary 
4  tuples  X  =  {xi,X2,  X3,  X4).  One  can  use  up  to  four  levels  of  partitioning,  or  the  equivalence 
sequence  {i?i,  ^2)  .^3)  .^4}-  We  will  consider  the  trellises  formed  by  the  equivalence  relations 
i?3  and  R4.  Using  the  same  notation  as  above.  Table  2  gives  some  schemes  we  considered  in 
the  simulation  and  computation  of  the  upper  bounds. 

For  level  4,  the  error  sequences  that  have  minimum  distance  are  ±(2, 0,  0,  0, 0),  and  for 
level  3  these  sequences  are  ±(2,0, 0,0).  The  simulation  results  and  the  upper  bounds  are 
shown  Figure  4.  Referring  to  Figure  4,  there  is  less  than  a  2dB  loss  when  considering  the 
8-state  instead  of  the  16-state  trellis.  As  expected  with  12-  and  14-state  trellises  the  prob¬ 
ability  of  error  is  better  than  the  8-state.  The  amount  of  improvement  decreases  as  SNR 
increases.  At  high  SNR  the  improvement  becomes  insignificant;  that  is,  the  12-  and  14-state 
trellises  approach  the  8-state  performance. 

VI.  NON-BINARY  MODULATION 

In  this  section  the  concept  of  state  partitioning  is  extended  to  include  non-binary  mod¬ 
ulation  schemes.  The  state  s(k)  is  represented  by  an  M-ary  n  tuple  (afc_i,ai;_2,  •  •  •  ,ajt-n), 
where  is  an  M-ary  symbol  given  by 

-(M  -  1)  •  d,  •  •  • ,  -d,  d,  •  •  •  (M  -  1)  ■  d  for  M-PAM 

Xk  jVk,  where  Xk  and  yk  take  values 
-(f  -  1)  •  d,  •  ■  • ,  -d,  d,  •  ■  ■ ,  (f  -  1)  •  d  for  M-QAM 

dexp{j^i),i  =  0, 1,  •  •  •  ,M  —  1  for  MPSK, 
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where  <i  is  a  function  of  the  transmitted  power.  The  number  of  states  in  this  case  is  given 
by  M".  Thus  the  set  of  channel  states  17  has  a  cardinality  of  M”. 

Following  a  similar  procedure  as  in  section  3,  the  equivalence  sequence  {i?i,  •  •  • ,  is 
used,  where  Ri  is  again  defined  as  xRiy  iff  the  first  i  components  of  the  n-tuples  x  and  y 
are  identical  for  all  x  and  y  E  17.  The  partitioning  tree  in  this  case  will  have  M  branches 
emerging  from  each  node.  Reduced  complexity  trellises  can  be  constructed  by  considering 
subsets  or  union  of  subsets  from  different  levels.  This  is  best  illustrated  by  the  following 
example. 

Consider  the  transmission  of  a  4-QAM  source  (Figure  5)  through  a  channel  whose  impulse 
response  is  given  by 


h{D)  =  ho  +  hD  +  h2D^  +  hsD^. 

In  this  case  the  channel  has  4^  =  64  states.  Since  the  memory  of  the  channel  is  3,  the  parti¬ 
tion  tree  can  have  up  to  3  levels.  The  partitioning  tree  is  shown  in  Figure  6.  Trellises  with 
different  numbers  of  states  can  be  constructed  using  the  subsets  shown  in  Figure  6.  Table  3 
lists  some  of  these  trellises.  Table  3  also  indicates  an  RSSE-type  trellis  with  the  same  number 
of  states,  if  they  exist.  Those  given  in  the  table  were  arbitrarily  chosen.  Since  the  channel 
has  a  memory  of  three,  for  the  RSSE  we  need  to  specify  three  parameters  (Ji,  J2,  J3),  which 
are  given  in  Table  3.  As  with  the  state  partitioning  technique  there  could  be  more  than  an 
RSSE-type  trellis  for  a  given  number  of  states,  an  arbitrary  one  is  given  in  the  table.  Eor 
illustration  purposes  consider  the  8-state  case.  In  Eigure  7a  we  present  the  trellis  obtained 
using  the  state  partitioning  approach,  which  is  the  same  the  RSSE  trellis  [4]  which  is  de¬ 
picted  in  Eigure  7b.  Eor  the  trellis  shown  in  Eigure  7b,  we  used  Ji  =  4,  J2  =  2  and  J3  =  1. 
It  can  be  seen  from  Table  3  that  the  state  partitioning  approach  results  in  a  wider  range  of 
trellises  than  with  the  RSSE.  As  a  result  it  offers  a  better  performance/complexity  tradeoff 
compared  to  the  RSSE.  No  attempt  has  been  made  to  compare  the  distance  properties  of 
these  trellises,  since  we  just  intended  to  show  the  existence  of  a  wider  range  of  trellises,  some 
of  which  could  not  be  produced  by  the  RSSE  [4]  approach. 

VII.  CONCLUSION 

In  this  paper  we  introduced  a  new  approach  to  reduce  the  complexity  of  the  VA.  This 
approach  is  based  on  partitioning  the  set  of  channel  states.  It  also  a  offers  good  complexity 
versus  performance  tradeoff.  It  was  shown  that  the  trellises  obtained  in  [5]  and  [4]  are  special 
cases  of  those  described  in  this  paper. 

The  state  partitioning  approach  results  in  trellises  with  an  arbitrary  number  of  states 
which  are  not  necessarily  powers  of  2  states,  as  in  [5]  for  a  binary  signal  set.  Depending  on 
the  length  of  the  channel  and  the  operating  SNR,  trellises  with  non-power  of  2  states  can 
attain  a  lower  probability  of  error  than  the  powers  of  2.  However,  at  high  SNR  the  improve¬ 
ment  of  the  former  over  the  latter  is  insignificant.  The  technique  can  also  be  extended  to 
include  complex  and  non-binary  signal  sets. 
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Figure  1:  The  discrete  channel  model 


a 


Figure  2:  The  partitioning  tree  for  a  binary  source 


Figure  3:  Trellises  for  the  schemes  given  in  Table  1 
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Figure  4:  Probability  of  error  for  different  partitioning  schemes 


Figure  5:  Constellation  for  4-QAM. 
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Level  1 


Level  2 


Level  3 


Figure  6:  Partitioning  tree  for  a  channel  with  a  memory  of  3  and  a  4-QAM  source. 


[0,0,0] 


R|a,a)  U  R^(a,c) 
R2(a,b)  U  R^CM) 


R|b,a)  U  R2(b,c) 
Rp,b)  U  R2(b,d) 
R|c,a)  U  I^(c,c) 
R2(c,b)  U  R^(c,d) 
R^d,a)  U  R^(d,c) 
R|d,b)  U  R2(d,d) 


Figure  7:  (a)  The  8-state  trellis  obtained  using  the  state  partitioning  approach,  (b)  The 

8-state  trellis  obtained  using  RSSE  [4], 


Number  of  States 

Partitions 

2 

{i7i(0),Ri(l)} 

3 

{Ri(0),  772(10),  R2(11)} 

4 

{772(00),  772(01),  i?2(10),  772(11)} 

5 

{773(000),  773(001),  772(01) 

772(10),  772(11)} 

6 

{773(000),  773(001),  772(01) 

772(10),  773(110),  773(111)} 

7 

{7?3(000),  773(001),  772(01) 

773(100),  773(101),  773(110),  773(111)} 

8 

{773(000),  773(001),  773(010),  773(011) 
773(100),  ^3(101),  773(110),  773(111)} 

Table  1:  Different  partitions  for  a  channel  with  a  memory  of  n  =  3. 
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Number  of  States 

Partitions 

8 

{J?3(000),  7^3(001),  E3(010),  i?3(011) 
J?3(100),  i?3(101),  723(110),  723(111)} 

12 

{723(000),  723(001) 

724(0100),  724(0101),  724(0110),  724(0111) 
724(1000),  724(1001),  724(1010),  724(1011) 
723(110),  723(111)} 

14 

{723(000),  724(0010),  724(0011) 

724(0100),  724(0101),  724(0110),  724(0111) 
724(1000),  724(1001),  724(1010),  724(1011) 
724(1100),  724(1101),  723(111)} 

16 

{724(0000),  724(0001),  724(0010),  724(0011) 
724(0100),  724(0101),  724(0110),  724(0111) 
724(1000),  724(1001),  724(1010),  724(1011) 
724(1100),  724(1101),  724(1110),  724(1111)} 

Table  2:  Selected  partitions  for  a  channel  with  a  memory  of  n  =  4 
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^  of  States 

Partitions 

RSSE  iJi,J2,Js) 

2 

{Ri{a)  U  i?i(c),  Ri{b)  U  Ri{d)} 

(2,1,1) 

3 

{Rila),Ri{b)  U 

(3,1,1) 

4 

{R^{a),Rr{b),  R,{c),  Rr{d)} 

(2,2,1) 

5 

{i?2(a, «),  ^2(«,  b),  R{a,  c),  R{a,  d) 

Ri{b)  U  i?i(c)  U  Ri{d)} 

does  not  exist 
for  RSSE 

6 

{i?2(a,  o),  R2{a,  6),  R{a,  c),  R{a,  d) 
Ui?i(d), i?a(c)} 

(3,2,1) 

7 

{i?2(a,  a),  R2{o^^  b),  R{a,  c),  R{a,  d) 

does  not  exist 
for  RSSE 

8 

{i?2(a,  a)  U  R2{a,  c),  i?2(«,  b)  U  i?2(a,  d) 
R2{b,a)  U  i?2(&,  c),  R2{b,b)  U  R2{b,d) 
i?2(c,  a)  U  i?2(c,  c),  R2{c,  b)  U  i?2(c,  d) 
R2{d,  a)  U  R2{d,  c),  i?2(cf,  &)  U  i?2(c^,  <^)} 

(4,2,1) 

10 

{i?2(a,  a),  .^2(0, 6),  ^2(«,  c),  i72{a,  d) 
i72(c,  6),  i?2(c,  6),  i?2(c,  c),  i?2(c,  d) 
Ri{b),R^{d)} 

does  not  exist 
for  RSSE 

12 

{i?2(a,  0),  i22(«,  b),  R2{a,  c),  i?2(a,  d) 
R2{b,a)  U  R2{b,b),  J?2(6,  c)  U  i?2(^,  d) 
i?2(c,  a),  i22(c,  6),  i?2(c,  c),  i?2(c,  d) 
R2{d,a)  U  i?2(d,  6),  i?2(d,  c)  U  i72{d,  d)} 

(4,3,1) 

14 

{i?2(o,  a),  d?2(«,  &),  d?2(a,  c),  i?2(ci,  d) 
i?2(^,  a),R2{b,  b),  i?2(&,  c),  d?2(&,  d) 
i?2(c,a),i?2(c,6),  R2{c,c),R2{c,  d) 
i?2(d,a)  U  jR2(d,c),  i22(d,  6)  U  i?2(d,  d)} 

does  not  exist 
for  RSSE 

16 

{i?2(«, «),  d?2{«,  6),  di2(a,  c),  i?2(a,  d) 
R2{b,a),R2{b,b),  i?2(&,  c),  i?2(^  d) 
i22(c,  a),  i?2(c,  6),  i?2(c,  c),  i?2(c,  d) 
i?2(d,  a),  i?2(d,  b),  R2{d,  c),  i?2(d,  d)} 

(4.4.1) 

64 

{i?3(a,  a,  a),  •  •  • ,  i?3(d,  d,  d)} 

(4,4,4) 

Table  3:  Different  partitions  for  a  channel  with  a  memory  of  n=3  and  a  4-QAM  source. 
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